General Disclaimer 


One or more of the Following Statements may affect this Document 


• This document has been reproduced from the best copy furnished by the 
organizational source. It is being released in the interest of making available as 
much information as possible. 


• This document may contain data, which exceeds the sheet parameters. It was 
furnished in this condition by the organizational source and is the best copy 
available. 


• This document may contain tone-on-tone or color graphs, charts and/or pictures, 
which have been reproduced in black and white. 


• This document is paginated as submitted by the original source. 


• Portions of this document are not fully legible due to the historical nature of some 
of the material. However, it is the best reproduction available from the original 
submission. 


Produced by the NASA Center for Aerospace Information (CASI) 



(NASh-CR-150059) CONTROL BY 

MODEL 

ERROR 

N77-11755 

E3TIHATION (California Univ. ) 

231 

P 


HC All/MF A01 


CSCL 12B 

Unclas 

L 


G3/63 

07951 ^ 





UCLA-ENG-7630 
MARCH 1976 


R.E. SKELTON 


CONTROL BY MODEL ERROR ESTIMATION 


oPrincipdl Investigator: P.W. LIKINS 






received 

NASA sn FACiUTf^ 
INf UT BRANCH 



UCLA-ENG-7630 
MARCH 1976 


CONTROL BY MODEL ERROR ESTIMATION 


Robert Eugene Skelton 


Principal Investigator; Peter W. Likins 


Prepared for George C. Marshall Space Flight Center 
Marshall Space Flight Center, Alabama 35812 
Under Contract No. NAS8-28358, Mod 12 
Contract Type; CR 


School of Engineering and Applied Science 
University of California 
Los Angeles, California 



ABSTRACT 


One of the most critical deficiencies of modern control 
theory is the reliance of the theory upon the fidelity of the 
mathematical model of the system. This research approaches the 
problem of model errors in linear system models by augmenting to 
the original system of equations an "error system" 

e = Py 
Y = DY 


( 1 ) 


which is designed to approximate the effects of such model errors 
as external disturbances, truncated modes, and parameter errors, 
when this "error vector" e is added to the system equations to 
obtain 

X s: Ax + Bu + e (2) 

Several researchers have concluded that increasing the order of the 

model > (via augmented "disturbance models," "dynamic compensators," 

etc.) can compensate for parameter errors,^ and external 
2“-6 

disturbances. There is very little guidance available, 

however, for the task of actually constructing such an auxiliary 

system for augmentation, or for determining whether such methods 

can also accommodate truncated modes, a problem which has in the 

7-9 

past been treated by singular perturbation. 

The point of view taken in this research is that the total of 
truncated modes, external disturbances, and parameter errors can 
be corrected by the addition of a "model error vector," e, to the 
original system equations. When state estimation techniques are 
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applied to the assumed "error system" (1) , and when the error 

system (1) belongs to a class called the Sturm-Llouvil'le equations 

(which generate orthogonal functions), the resulting estimator is 

labeled an "orthogonal filter." A special case of the Sturm- 

Liouville equations called the Chebyshev error system generates the 

Chebyshev polynomials, which provide a least squares fit to the 

measurement residual over an interval x. The important selection 

of this "observation window", x, is related to a property of the 

closed loop system called the characteristic time (x ^ max , 

X -v(x) 

where v(x) is a Liapunov function for the asymptotically 
stable linear system) , Thus the parameter x is both a model param- 
eter and an element of the control system serving to interweave the 
processes of model development and control system design. 

In this dissertation a Chebyshev error system is developed for 
application to the Large Space Telescope (LST) , an earth satellite 
which NASA plans to launch in the 1980' s. 
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1.0 Introduction 


The task of determining a control policy so that the physical 
dynamical system behaves in a desired manner usually begins with the 
responsibility of obtaining a mathematical characterization (model) of 
the physical system. However, any mathematical description of the phys- 
ical system is subject to "model errors" in the sense that the mathe- 
matical description does not precisely describe the dynamical behavior 
of the physical system. Moreover, there presently is no well developed 
theory to assist one in selecting a mathematical model of the physical 
system which is appropriate for the particular control task at hand. 

It is for this reason that model errors are a common cause of diver- 

* 

gence in the state estimators used in the feedback control policies. 

For linear models of systems and under other appropriate conditions, 
such state estimators are commonly called "Kalman filters" in stochastic 
characterizations of problems and "Luenberger observers" in deter- 
ministic characterizations of problems. It is understood by users of 
the theory that model errors establish performance limitations of the 
closed loop physical system, and thus the realized performance may be 
unacceptable even though the design is based upon optimal control theory. 

The reason this model error problem persists in spite of the vol- 
ume of "model reduction" literature that exists (see for example^^ 
lies in the fact that the appropriateness of a model depends upon the 

* 

Model errors also limit performance of classical designs. The modern 
"state estimator" approach is utilized throughout this paper because 
its structure includes classical designs but is not limited to them. 
That iS; a classical compensation design may be viewed a state esti- 
mator for some model of the system. 
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nature of its (desired or undesired) Inputs, As a consequence of this 
fact, the '•modeling problem" and the "control problem" should not be 
treated as separate and independent problems. Efforts to treat them 
separately usually result in control policies which either 

(a) suffer from modeling errors, causing performance requirements 
to be violated 

or 

(b) satisfy performance requirements at a high cost of control 
policy synthesis. 

The conditions which cause (a) are sometimes not discovered before 
"flight" of the system. The condition (b) often prevents use of modern 
control theory in an application. To more sharply focus questions of 
model errors, attention is restricted to linearized models of the 
physical system. 

In the quest for an appropriate system model (call it It is 

necessary to consider the impact of the choice of models on the total 
closed loop system performance. It is helpful to consider the exist- 
ence of a hierarchy of linear models as shown in Figure 1-1. In this 
hierarchy, it is assumed that the physical quantity to be manip\ilated 
for control purposes has been selected (corresponding to its mathe- 
matical symbol, u°(t))* Likewise, the mathematical symbol selected for 
the measurement vector is z°(t), corresponding to the selected physical 
quantities. The output vector, y°(t), represents those variables we 
actually wish to control, as opposed to the variables which are actually 
measured, z -(t). To complete the model, then, we must write some dif- 
ferential equations which dynamically connect the systems input 
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Figure 1-1. Hierarchy of Linear Models. 
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(desired and undesired) to the system outputs. The order of these 
equations (or, the number of equations in the state form of Figure 1-1) 
remains to be determined; thus the hierarchy of possibilities illus- 
trated in Figure 1-1. 

The dilemma of model selection is complicated by two conflicting 
goals. We wish to implement the simplest solution to the control prob- 
lem, for economic reasons, and we wish to achieve satisfactory perfoirm- 
ance. If the models of Figure 1-1 are arranged in order of decreasing 
model fidelity* (fidelity being defined according to their ability to 
faithfully predict the output of the physical system, y°(t) for a 
specified input and initial conditions), then the better closed loop 
system performance is achieved by designing the control policy on the 
basis of a model at the upper end of this model hierarchy, (since more 
information about the physical system is made available) whereas the 
simplest control policy obtains from the lower end of the model hier- 
archy. The limits of this argument lead us to the conclusion that truly 
optimal control decisions (policies) can come only from models which 
are not finite in size (again assuming there is some small region 
around the origin of state space in which linearized models are valid), 
alternately, that the simplest feedback control policies require no 
model at all (corresponding to open loop control) . This is an easy 

The actual arrangement of a set of models in order of their fidelity 
would be a moat difficult task in itself owing to the fact that as the 
models become larger (i.e., as more modes of the system are included) 
more uncertainty is associated with the parameters. 

The existence of the hierarchy of models in Figure 1-1 has been argued. 
However, the actual construction of the "arbitrarily good" model 
mentioned above is impossible, for obvious reasons. 
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way to visualize the fact, well known to users of optimal control 
theory, that optimal solutions are only as good as the models upon 
which they are based. Thus, in solving optimal control problems there 
is always the (often unstated) economic constraint imposed on model 
size. In problems of analysis this constraint may be imposed by the 
capacity or cost of available computers. In problems of feedback con- 
trol design this constraint may be imposed by the cost of synthesizing 
the control policy. The model size versus controlled system perform- 
ance dilemma leads us to the characterization in section 2.0 of a 
problem we would like to solve (but cannot) called the ''Minimal Con- 
troller Problem." Because the Minimal Controller Problem cannot be 
solved, it is typical to employ various schemes for "discrediting" the 
model (upon which control design is based) and these past approaches 
are discussed in section 3.0. 


The approach taken in this research begins in section 4.0 and 
centers around the question, "Can an n^ vector, e(t), (called a model 
error vector) be found such that when e(t) is added to the state equa- 
tions of any model jJ. ^ (see Figure 1-1), the output of the model is 
caused to be identically equal to the output of a better model, 

Pursuit of this question ( and the adding of precision to it) leads one 
to describe model error effects in four categories: 


1, truncated modes 


2. parameter errors 

3. neglected external disturbances 

4. neglected nonlinearities. 
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Characterizing ah error vector e(t) which compensates for the above 
effects Is not a straightforward task because one does not usually 
know, prior to design of the control policy, what disturbing effects 
(from external disturbances and truncated modes) are important to keep 
in the model. The control designer must typically rely upon perform- 
ance evaluations In a trial and error fashion to determine what 
disturbance effects and modes of the dynamical system should be 
actively controlled, and therefore be Included In the model upon which 
the control law is based. Also parameter uncertainties (whether 
constant are due to ''In-fllght" changes) and variations in the 
disturbance environment cannot usually be reliably predicted. It is 
for these several reasons that the appropriate model error vector, 
e(t) , cannot be rationally specified prior to control system design. 
Therefore, the modeling problem Is proposed as a two-phase task, as 
shown in Figure 1-2. 

STEP Coarse model reduction ->■ Figure 1-2 describes 

a reduction from a conservative model, to a reduced model 4 ^^ and 

is accomplished prior to control design) . 

STEP II. Vernier model adjustment ^ "*^2 Figure 1-2 

describes a small Increase in the order of the model by the augmenta- 
tion of an "error system" to the model and must be accomplished 
during the control design task. It will be shown that control designs 
based upon model are less sensitive to modeling errors than designs 
based upon model dy 

This research is more concerned with a theory of modeling of 
linear systems than with any new theories of control, although from 
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prior discussions we know that the two cannot be considered entirely 
separately. Therefore* section 4.0 will discuss the idea of control 
by model error estimation when the control problem of Interest Is the 
linear regulator problem of optimal control. In section 5.0 that 
particular choice for the model "error system" is made such that the 
model error vector considered evolves from a set of orthogonal functions 
which are considered to span the space of anticipated errors. A brief 
outline of the design procedure is giv*^n in section 5.0. This method 
of control design is applied to the Large Space Telescope (LST) which 
the National Aeronautics and Space Administration (NASA) plans to 
orbit in the 1980' s. 

The relationships of various aspects of this work with existing 
theories and practices are cited throughout the text, but a summari- 
zing discussion of this sort Is given in section 7.0, along with con- 
clusions and suggestions for further research. 
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2.0 


Some Effects of Model Errors in Control Problems 


As modern control theory rapidly developed in the '60’s the 
press of the control theorists to alleviate system performance limi- 
tations Imposed by the control law (which he might have viewed as his 
responsibility) exposed, sometimes dramatically, the effects of errors 
in the system model (which he might have viewed as the dynaraicist’s 
responsibility) . Such model error effects were not always discovered 
before "flight” of the system. Since failures of modern control 
Implementations do not usually make the journals one can be mislead, 
on the basis of the literature, about the frequency of successes of 
modern control theory applications. Optimal control theory "squeezes" 
the models hard, and a theory that tells how to get more out of the 
system ( model) puts greater pressure on the modeling process. Thus, 
the practicing engineer quickly learns that "optimal" solutions are 
only as good as the model upon which the solutions are based. 
Furthermore, since models for physical dynamical systems are always 
approximations, we must conclude that truly "optimal" decisions can 
only come from truly "best" models (since there always exists a 
better model, here "best" at least implies not finite dimensional). 
Therefore, practical questions of optimality carry an often unstated 
consideration of implementation constraints. This leads us to the 
notion of the problem we would most like to solve (but cannot) . 


In the mathematical sense the word "optimal" always means with respect 
to the given model and with respect to the chosen performance criteria . 
Poor system performance can also be attributed to a poor choice of 
performance criteria rather than inadequate modeling. 
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The Minimal Controller Problem; 

Find a control policy of minimal complexity which allows the 
required performance to be guaranteed. That is, find the control 
policy u° « u°(z^, A^, , M^) which although based upon real 

measurements z° and the model, 4^^ (of the physical system, jJq)^ 
satisfies the performance requirement 


4 ^ 0 ly*^! Iq + I |u°(z®,4^^) 1 lj^)dt i v° 

with minimum n^ , where V° is a specified number and where 


A A physical system 
0 ~ 


= A^x^ + B^u°l 


J - 

2 ^ 


u : actual inputs CR 

o - 

y : actual outputs CR 






m 


actual measurements CR 

X" ; ■ 


I 


• state C R ^ 


u 


control C R 


m 


4 ^ 

y "* ; output C R 
z^ ; measurement C R' 


£ 


Note that z is a real measurement and cannot be predicted with 
certainty. Now therefore the Minimal Controller Problem is not a 
mathematical problem (due to the requirement of certainty, V ) 

and cannot be solved mathematically, and for that matter we have no 
procedure to solve it even empirically. However one might attempt to 
solve the minimal controller problem in the following way. Presume 
that the output variables y° which we desire to control have been 
identified and that the "controller," u°(4<£j, z°) , is composed of the 


The problem cannot be solved even in the statistical sense; see 
section 3 .Q. 
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o T 

control law u = Gx arid the state estimator 
A i i i - ''1 i o o * 

a. Choose the decision variables, u° (with due regard to the fact 
that the model presumes that a°(t) can be instantaneously changed. 
In practice any delays in changing u must be small with respect to 
the time constants associated with both the open loop and closed loop 
system of model Jid^ . 

b. Choose a model order, n, (for the sake of this discussion 

J 

begin with ^1). 

c. Choose model parameters (A^ , * M^) (Note that this 

involves, in (A'^ , M^), decisions of what measurements to make and in 
(A'^ , B^) decisions of where to apply the controls). 

d. Choose a controller (that is, choose a functional relationship 
between the measurements, z°(t), and the control issued, u°(t)), 

e. Evaluate and test the inequality ^ V°. (If yes stop. 

If no, return to c. or d. After exhausting "all” possibilities in c. 
and d., return to b. increasing by 1. Continue until e. is 
satisfied.) 

The statement of the minimal controller problem serves only to remind 

us of desired objectives and is not in itself constructive. Indeed, 

it is because we cannot solve the Minimal Controller Problem that we 

are forced to pay close attention to model errors. 

Before considering the effects of model errors we should define 

the terms. Errors in linear models may result from; 

The details of estimator (full order and. reduced) design will be 
discussed in sections 3.0 and 4k0. 
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1. Truncated modes (those errors Introduced by model reduction, 
recognizing that any mathematical model may be considered a reduction 
from a better one; thus this effect is always present in models of 
physical dynamical systems.) 

2. External disturbances (those unmodeled effects which are 
independent of the state variables used to describe the model) . 

3. Parameter errors (even if the stmcture (order) of the model 
is appropriate, there may exist better values for the parameters). 

4. Effects of linearization (neglected nonlinearities). 

Now, we wish to know now what price we pay for the presence of such 
modeling errors. 

2.1 Some Effects of Model Errors in Deterministic Characterizations 
of Dynamical Systems. 

Suppose we wish to minimize the functional 


J(u°) ~ \ (y^ Qy^ + Ru°)dt + -y y^(T)^ 11 y^ 


(T) 


( 2 . 1 ) 


subject to the time- invariant linear system model, 



i? = + B^u° 




i "i 
G R ^ 


y^ C R^ 


x(t ) = X 
o o 


( 2 . 2 ) 
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Furthermore, let the pair (A^ , B^) be controllable and (A'^jC^) 
be observable in the sense of Kalman (also, let Q, R, be positive 
definite matrices). Then, according to this model, the closed loop 
optimal system obeys 

= A (2.3) 

where K is the symmetric, positive definite solution of the matrix 
Riccati equation 

T .T 

K = - K A^- A^ K + K R~^ fif K - Q (2.4) 

K(T) = H 

Now A has an arbitrary spectrum. That is, there exist real values for 

— 13 

Q and R such that the eigenvalues of A may be arbitrarily specified. 
Then, on the basis of the analysis of this system model, arbitrarily 
good performance can be predicted. That is, there exist Q, R, such 
that 

J(u°) < 0 Ve S 0 (2.5) 


= [a^ - 


b\"V 


To synthesize (operate the real system) with this ’’optimal" control 

T 

o i i 

policy, u = - R B^ K x“^ , a "separation principle" is usually invoked, 
as follows. The control law is determined (above) as if the state 


n .-1 
3 




*The pair (A^ , B'^) is controllable if rank [B^ , A^B^ . . . A^ B^]-n . 

, n.“l 

,T ,T .T .T ^ ,T 

The pair (A^,C/^) is observable if rank [C^ ,A^ , . . . ,A^ * 

In this report the word "controllable" will always mean v/ith respect 

to the state vector (that is (A^,B^), satisfy above condition.) Output 

n, - 

. . •• A * • 3 “^ ’A 

controllability (rank [C^B^, C^aV , . „ . ,C^A^ ] [G^B^, 


, . . . ,C'^A'^ B-J ] = ^j) is assumed throughout. 


. ."j-1 


^ 3 ^ = 


13 



were available for measurement. A state estimator is then constructed 


to asymptotically reconstruct . The estimate of , labeled x^ , is 
then used to synthesize the feedback control solution, 
u° = - R Kx^ . The justification offered for this approximate 
implementation of the optimal solution is that the state estimator (or 
state ''observer'’ as the literature on deterministic problems prefers) 
can, under certain conditions, (observability of the pair (A^,m'^)) 
be made arbitrarily ''fast" so that errors between the state, , and 
the estimate of the state, x^ , are arbitrarily small after an arbi- 
trarily short period of time. Thus, the synthesized "controller" is 
as shown in Figure 2-1, where it is noted that the estimator has an 

arbitrary spectrum (eigenvalues of (A*^ - K M'^) may be arbitrarily 

. . * 1 

*1 T tQ 

specified) by virtue of observability of A , M . The full n -order 

form of the state estimator is shovm in Figure 2-1 instead of the 

reduced, -order form, v/hich might be implemented in practice, 

to illustrate more clearly the effects of model errors. If the control 

gains G are taken as those optimal for the model xi. then 
_1 .T ^ 

G = - R K, see (2.1) - (2.4). Now the problem that sometimes 

occurs V7ith the control scheme of Figure 2-1 is that in spite of the 
fact that arbitrarily good performance can be predicted on the basis 
of the model , the physical system ^ can deliver arbitrarily bad 
performance (i.e<, y diverges from desired value y -0). The gen- 
eral situation is that while "arbitrarily good" performance is not a 


58 

See Wonham for one proof of this fundamentally important result of 
linear system theory which guarantees the existence of a real matrix 
M which will give the matrix (L+MN) an arbitrarily specified set of 
eigenvalues if and only if the pair (L,N) is observable. 
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III 
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Control Law is characterized by (G) . ' CONTROL PROBLEM 


Figure 2-1. The Controller for the Linear Regulator Problem. 
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practical requirement, we cannot be sure that even "acceptable" 
perfctmance will be actually achieved. 

To see how model error in the form of neglected external dis- 
turbances can have the deteriorating effect described above, consider 
the statement of the typical optimization problem (2.1), (2.2). We 
see immediately that the solution is optimal for any initial condition , 
x^ . Thus, if some impulsive disturbance "resets" the initial condl- 

-j 

tions to a new value the system will respond optimally toward y'* = 0* 
However, in the presence of any disturbance which is not impulsive 
(correlated in time) the system will not respond optimally even if 
there are no other errors in the model. In fact, however, each of 
the model errors classified earlier can cause divergence of the 
physical system output.. If one is inclined to blame the control 
theory for this condition then he may search for new ones (such as 
the adaptive control theories) . If one is inclined to blame the 
modeling process for this condition, then he may search for better 
models. This latter attitude is presented in this research. 

A few words are needed here to justify our attention wholely to 

state space models and further, to state space models which are not 

* 

minimal realizations. Our rationale rests solely on the argument 
that physical dynamical systems are not completely state controllable. 
To be more precise we must argue that as we progress up the hierarchy 
of (mathematical) models in Figure 1-1 (which, recall, is arranged in 
order of fidelity of the models) , the expectation of controllability 

yp 

liinimal realizations are state observable, controllable models, see 
Reference 40, p. 94. 
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of the pair (A^ B^) decreases. This notion, is at odds with those, 
offered by Ho and Lee,^^ Kreindler and Sarachik,^^ and Schultz and 
Melsa, Reference 1.6, p. 29, who suggest that uncontrollable models of 
physical systems are "inadvertent." To support our claim to the 
contrary consider the 

PROPOSITION: For every completely controllable linearized model of a 

physical dynamical system, there always exists an uncon- 
trollable linear model of the physical system which more 
accurately portrays the outputs of the physical system. 
For supporting arguments we cite two separate circumstances, the first 
based upon the presence of external disturbances, the second on inter- 
nal disturbances. First consider that for any model of the system 
there can always be identified some disturbance which is "external" 
in the sense that it does not depend upon the state variables pre- 

ic 

viously in the model (i.e., the neglected gravity gradient torques 
on a spacecraft, or thermal gradients on a ground based antenna etc.). 
Now adjoin a differential equation which that disturbance obeys (find- 
ing that equation is quite another task) to the system description. 

The resulting composite system then provides a more faithful charac- 
terization of the Output, y(t), than did the model which neglected the 
disturbance. Moreover, those variables associated with the model of 
the disturbance are uncontrollable. The second circumstance of 

* 

Thus, whether a particular effect qualifies as an external disturb- 
ance depends upon the particular model, » ^t hand. Note that such 

disturbing effects fall in either the category of external disturb- 
ances or truncated modes, as listed on page 12. 
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"Internal disturbances" (truncated modes) is discussed In detail by 

66 

Likins and Ohkaral, In this report It Is shov/n that some modes may 
be found In a linear model of a physical dynamical structure, such as 
a spacecraft, which are uncontrollable. 

Now the danger in using minimal realizations to describe physical 

processes is that while under controlled conditions (i.e. , laboratory 

prototypes) a controllable model may adequately describe the response 

of the output, under "flight" conditions the natural environmental 

disturbances (and truncated modes) may excite uncontrollable modes 

* 

(which are present in the physical system ) . In this circumstance the 
controllable models are loathe to explain the discrepancies observed 
in the measurements and hence even adaptive techniques, designed to 
update parameters within a controllable model, may fail to yield 
stabilizing feedback. Thus, the choice of words "minimal realization" 
to describe controllable, observable models of minimal order is per- 
haps unfortunate since they should not be taken to mean that if a 
model is not controllable there is another, equally credible, model 
that is. 

It is not surprising then, that Astrom^ has found that increasing 
the order of a model can compensate for parameter errors (without the 
use of adaptive techniques). The converse is not true. That is, 
changes in parameters may not compensate for the effects of truncated 


The phrase "physical system" necessarily includes effects which qualify 
as "external disturbances" with respect to a simpler (mathematical) 
model. See footnote on page 17. 
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raodes. This point is discussed in section 3.0. Concerning 
uncontrollable modes then we conclude that they may be (automatically) 
truncated from the model only to the extent that they are also unob- 
servable. 

Finally, we note that classical techniques are restricted to 
controllable (and observable) models and cannot be used to directly 
compensate the effects of uncontrollable modes and external disturb- 
ances. (This is one reason why classical design procedures are often 
characterized in the oversimplified terms "design for stability, then 
check for performance" and optimal control design procedures in the 
terms "design for performance, then check for stability"). Compen- 
sation of external disturbances may be (indirectly) accomplished via 
classical techniques if the right structure for the compensation ±s 
proposed a priori. For example, if a bias disturbance is noticed (in 
the "check for performance" phase of design) , by the appearance of a 
bias error in a pointing control problem, then an integrator can be 
added in the compensation to correct for this disturbance. We will, 
instead, use a procedure which allows consideration of both uncontrol- 
lable modes and the treatment of disturbances directly. 

One criticism which has been leveled at control policies which 
utilize state estimators (see Figure 2-1) is "Why estimate all of the 
state variables when only some might accomplish a satisfactory control 
policy?" Our response to this question is that if we consider our- 
selves to be responsible for both the model and the control law design, 
then, out of concern for the Minimal Controller Problem (presented in 
section 2.0), we take the attitude that if a particular set of state 
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variables does not need to be estimated to accomplish the control 
task then those state (modes) do not belong in the model to be used 
for controller design. 

The overall control design task is often considered, from Figure 
2-1, as three separate problems: 

1. Selection of (A^ , , M^) (including dimensions as well 

as parameter values) as the modeling problem , 

A 

2. Selection of K as the estimator problem , 

3. Selection of G as the control problem . 

(Alternately, items 2 and 3 taken together may be considered as the 
controller problem) . 

In this research we will consider how these three problems might 
be related through an approximation of the model errors which are 
inherent in any finite dimensional model of a physical dynamical 
system. 

A 

It is noted that the matrix K in Figure 2-1 can be chosen accord- 
ing to a number of different criteria. If the problem description is 

A 

deterministic,^ as above, then K is chosen for desirable stability 

properties of the estimator. If the problem description is stochastic 

the K can be fixed in terms of statistical properties of the random 

variables of the problem. Thus as a special case Figure 2-1 also 

describes the Kalman filter which is discussed in the next section. 

2.2 Some Effects of Model Errors in Stochastic Characterizations of 
Dynamical Systems 

In the previous section the point was made that model errors in 
deterministic models can cause state estimators to diverge, resulting 
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in instability of the closed loop system. One approach often used to 
provide a "dynamical cover" for the model errors is to add a noise 
"disturbance" term w(t) to the model and write 
x = Ax+Bu + w(t) 

so that the ensuing mathematics will not lead us to believe that the 
model (of the undisturbed system) is "arbitrarily good." In other 
words some noise (real or imagined) can be added to "discredit" the 
previous model. Now the point to be made in this section is that if 
the actual disturbance acting on the system is a correlated random 
process and we have modeled the disturbance w(t) as a white noise 
process then the resulting state estimator may still diverge (further 
details will follow in section 3.0). Moreover, there may not exist 
any set of parameters within the fixed structure of the model or any 
set of parameters for the covariance matrices of the random variables 
(only first and second order statistics are needed for the optimal 
linear state estimator) which will yield a stable estimator. 

To illustrate this problem let us refer to Figure 2-2 which shows 
the solution of the linear quadratic Gaussian problem with the result- 
ing Kalman filter identified. From the Figure it is seen that the 
model of the physical system is shown with the same notation 
(the superscript j is chopped in Figure 2-2 for clarity) as in the 
previous section with the exception that the initial condition x^, the 
plant disturbance, w(t) , and the measurement noise v(t) are all 

It will be shown in section 4.0 that there exists a "model error 
vector," e(t), which when added to the model (i.e., x=Ax+Bu+ 
e(t)) can correct for model errors due to a) truncated modes (unmod- 
eled dynamics), b) external disturbances, c) parameter errors, and d) 
weak nonlinearities. In this section e(t) is considered to be white 
noise. 
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PHYSICAL SYSTEM 



X + u + w(t) 
z ■ M X + V 

T 

-11 >> 

Control u*-R B'' Kx“Gx 
where K obtains from (2.4), and 

X - x + B^ u° + K(z°- x) 

/s 

where K ■ E R 

T T 

Z - A^Z + Z A^ - ZM^ r“^^Z + Q 
o o 



w(t)~ N(0, Q(t)) 
v(t)~ N(0, R(t)) 

x^ ^N(0, P^) 
o o 

E[w(t)w^(T) ] ■ Q(t) 

E[v(t) v(t)] - R(t) 

E[w(t) X - 0 

o 

E[w(t) v^(t)] - 0 
E[v(t) xj] - 0 
E(x(t)x'^(t)] A Z(t) 


6(t-T) 

6(t-T) 


Figure 2-2. Linear Quadratic Gaussian Problem. 
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Gaussian random vectors with zero means and covariances P • Q(t), and 

o • 

y\ 

R(t) , respectively. Furthermore, v(t) and w(t) are uncorrelated 
(white) processes. Now even though we assume the model is com- 
pletely controllable and observable in the stochastic sense, the 
Kalman filter (state estimator) may still diverge for the same reasons 
as in the deterministic case: model infidelity. Consider the particu- 

lar example of very small, or zero, plant noise Q a 0. Now, the 
estimator gain, K, which was non-uniquely chosen for stability of the 
estimator in the deterministic problem of the previous section, (com- 
pare Figures 2-1 and 2-2) is now uniquely determined in terms of the 

statistical parameters of the random vectors x , w(t) , and v(t) (as 

o 

^ A 

shown in the equations for K in Figure 2-2), and K is called, in this 

A. A 

case, the Kalman gain. Since Q 0, K has a zero steady state solu- 
tion, due to the fact that E = 0 is a steady state solution of the 

A 

covariance equation. The result is filter divergence since for K = 0 
the filter is no longer causally related to the measurements. Thus, 
the filter "thinks" it has "learned" the model and disconnects itself 

A 

from the noisy measurements (by making K ->■ 0) and relies only on pre- 
dictions from the (always erroneous) model . Divergence occurs due 
to the fact that undue confidence is placed on the model (here the 
word "model" includes the statistical parameters Q, R) . 

A 

This example Illustrates how the gain K serves as an "arbitrator" 

to weight between the measurements (which evolve from the real system 

^ ) and the predictions (which evolve from the selected model A. of 
o j 

the real system ^ ). The estimator "decides" which to believe 

o 

(weight more heavily) based upon model parameters presumed (Q(t), R(t) , 
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E , Thus, when the model itj inadequate, the "decisions" made 

o 

by the filter can be bad, leading to system instability. This example 

also has led many researchers to seek a "fix" for this divergence prob- 

^ 18-20* 

lem by increasing Q in some way, thereby artificially, or other- 

wise, decreasing the confidence in the model by causing K to be 
increased. Detailed conditions of Kalman filter divergence may be 
found in References 21 through 24. 

21 

Conversely, the theorem of Price gives the conditions for uni- 
form asymptotic stability in the large. Under stated conditions the 
theorem assures that the norm of the coyari,ance matrix has an exponen- 
tial upper bound 

||e[(x^ - x^)(x^ - x^)'^]|| < e ^ 


•1 ' 

where x** is the state of a better model of the same dimension (i.e. , 
more correct parameters) and x"^ is the state of the model upon 
which the filter design is based. The unsettling feature of this 
result is that it is still possible for the filter to satisfy these 
conditions (and thus be uniformly asymptotically stable in the large) 
and yet, within time constants of the system , yield arbitrarily bad 
estimates. This can occur because no upper bound is established for 
and o^. 


In a similar manner, persistent unmodeled "disturbances" (which 
may result from truncated modes as well as external disturbances) do 
the same "damage" to a deterministic (Luenberger) observer as unmodeled 
* 22 

Fitzgerald, shows that an increase in Q can also make the estimates 
worse. 
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correlated disturbances do to the Kalman filter. We now refer to some 
specific alternatives and past attempts to compensate for the presence 
of model errors in the control design problem. 
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3.0 Past Approaches to Compensation for Model Errors 

What follows in this section is a brief summary of various methods 
of control design in the presence of modeling errors of the type dis- 
cussed in section 2.0. A point of view one can adopt to help unify 
these approaches is to consider that the basic objective common to all 
methods to be discussed is to somehow "discredit" the mathematical 
model. After all, discrediting an erroneous model is a first step to 
better control decisions. 

Basically, the problem is as follows: We have committed the 

physical system to a linear model, gi 


= A^x^ + B^u° 
y^ = C^x^ 


(3.1) 


where z^ represents measurements (as predicted by the model, 
where u° represents the "decision," or "control," variables selected 
for manipulation, and represents (as well as this model is able to 
predict) the output variables, the object of our control. We wish the 
variables y^(t) to behave in a certain way in accordance with some 
mathematical criteria we have specified to reflect the desired per- 
formance of the physical system. Now, the selection of u°(t) is to be 
based upon analysis of the model (to avoid trial and error methods 

with the physical system) . On the basis of this "analysis" (the pre- 
diction of physical system performance on the basis of the model, ^^) 
we can provide u°(t) in an open loop fashion. 
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closed loop fashion (if some measurements z°(t) are available), 



and modifications of the closed loop fashions which include the 



This latter scheme falls under the category of "adaptive" control. 
Now let us begin our brief survey. 
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3 , 1 Feedback Control 


If the model was a perfectly accurate representation of the 
physical system then the physical output y°(t) would behave in exact 
agreement with the output y^(t) as predicted on the basis of analysis 
with the model ^ j • this event, one can issue the controls u°(t) 

to the physical system in an open loop manner with confidence that the 
physical output y°(t) will behave exactly as predicted. However, the 
effects of nonlinearities, parameter errors, and external disturbances, 
and modes of the dynamical system which have not been modeled (the 
collection of these effects has been labeled in section 2.0 as ''model- 
ing errors") cause physical systems to deviate from their modeled 
behavior. It is primarily for this reason that feedback control has 
been so useful. Even though the selection of gains or dynamical ele- 
ments in the feedback path is still based upon analysis with model . , 
the current measurements from the physical system serve to automati- 
cally "update" the initial conditions presumed for the model. The 
open loop command, on the other hand, is based upon prediction of 
system performance for all future time from the fixed initial condi- 


tions presumed for the system. Thus, feedback has the effect of 
shortening the time over which the model needs to provide accurate 
predictions. 


3.2 Conservative Designs 

To partially compensate for uncertainty in the model a "safety 
margin" can be incorporated into the performance specifications. 
Uncertainty in the disturbance environment may lead one to use "worst 
case" disturbances for control system evaluation purposes. Similarly, 
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*’minimax" designs have been used, to obtain the minimum of a 
performance criterion subject to those particular parameter changes 
which tend to maximize the criteria. 

3.3 Parameter Sensitivity Approaches 

0/1 O / 

The attitude taken in sensitivity approaches, is that if the 

values of the parameters of the model are uncertain then one should 
add sensitivity measures to the performance criterion to be minimized. 
Sensitivity of the original performance criterion or sensitivity of 
the state trajectories may be considered. In the latter case the new 
optimization problem is: 

Minimize 

'o 


J(u) =f (x"^Qx + s^Ss + u'^''Ru)dt 
Jr\ 

X = A(p)x + B(p)u 


(3.2a> 


subject to 


^ X + A(p) .1+ ^ u+B(p) 


8p, 


3Pl 
(3.2b) 


.r 3A(p) . . / s r, 9B(p) ^ 9u 

S * It + A(p) S + -g^ U+S(p) ^ 


, i . 9x 

where s A 


= Bp, 


Bs 

Bp 


= 0 (assume) 


u = X + ® 

and Q , S , R are positive definite matrices. 
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Newmann, shows that this problem generally has no solution and 
33 

Quardabassi, proves why; the system is uncontrollable for a suffi- 
ciently large number of parameters (r) . Some results can be obtained 

if r is small and the requirements of positive definiteness of S 

34 

are relaxed. Sakharov suggests that, in general, it is necessary 
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to use first and second order sensitivity models. However, if the 

trajectory x(t) is itself erroneous because of model errors then 

differentiation of x(t) (with respect to p) provides variables, 

3x 

s = , which seem to provide even less reliable information about the 

system. Differentiating x twice (to obtain second order sensitivity) 
seems even more hazardous. Also note that the state vector is doubled 
in size for each parameter considered uncertarln in the original model. 

3.4 Singular Perturbation Approaches 

The sensitivity methods above increase the order of the system 

under study. That subsystem (3.2b) which was augmented to the model 

(increasing the order) was based solely upon information contained in 

the smaller model, jt^y An alternate approach is to consider that 

every .model of a physical system can be considered a truncation from a 

better model. In this view it is possible to increase the order of 

by adding to those modes which were truncated to obtain 

originally. Now the optimal control solution for this higher order 

model can be approximated by the application of singular perturbation 
35 

methods. The method applied to the linear regulator problem proceeds 
as follows, see References 7 through 9. Presume that the equations 
can be put in the form 


X 


.t 

ex 


^ 11 ' 

^2i 


*12 

^ 2 * 


“I . 

•+ 

B^(e) 

1 t 
[ X 


BjCe) 


u 


(3.3) 


where e is a small parameter and where x is an n-vector of variables. 
Now X*" is an N-vector (N might be large) of variables we would like to 
truncate from our model, but because the truncated model 
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X * X + u, let 


A A A^^(O) - A^2<0)A22“^(0)A2j^(0) 
B 4b^(0) - ^2^°^ 


(3.4) 


might yield a controlled system which is sensitive to the neglected 
effects of X , we Instead wish to find an approximate solution of the 
control problem using the higher order model (3.3). The exact solution 
for 

T 

min J(u) = I (x"^ Qx + u^Ru)dt + x(T)H x(T) (3.5) 

u •'0 

T t 

subject to (3.3) and x(0) = x i x A (x, x ) ; Q,F, positive semi- 


definite and R positive definite, is 
-1 T 

u = - R B K X 


(3.6) 


where K obtains from 


K»-KA-A^K + KBR"^B^K-Q K(T) = H (3.7) 


and 


A = 


Aj^l(e) 


A^iCe) 




A22(e) 


e J 


B = 


B^(e) 


B2(e) 

L e 


(3.8) 


0 


OJ 

\ 


H = 


H, 


th 


Thus a Riccati equation for the (n+N) order system must be solved. 

Now, in Reference 9 the approximation suggested is that the control 

-1 T 

gain matrix G = - R B K be expanded in a MacLaurin series in e 
and the first two terms be kept. 
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This result leads to a requirement to solve (N(N+l)/2 + n(n+l)/2) 
linear scalar equations for G (and inverting an NxN matrix) instead of 
the (n+N) (n+N-l)/2 nonlinear scalar equations required in the exact 
high order solution. 

The method proposed in Reference 8 provides an alternate approxi- 
mation by solving a boundary layer problem using two time scales, t 
t— T 

and T = . The Riccati matrix K is then approximated by "slow" 

and "fast" solutions 

K(t,e) = K(t,e) + K(x,e) 

where K(t,e) is a power series in e and is a function of the slow 

variable, t, and where K(T,e) is a power series in e and is a function 

of the fast variable, x . The solution is then obtained by matched 

35 

asymptotic expansions. The number of scalar equations to be solved 
is the same as in the above case. 

This solution of the problem handles higher order effects (from 
modes that might be truncated in a low order solution) very well. It 
is still susceptible to external disturbances and parameter errors. 

The Implementation of the solution (to form the controller) is of 
the higher order. 

3.5 Parameter Estimation Techniques 

Suppose we have determined that the zero state response of a 

physical system can be reliably modeled by the completely controllable, 

observable linear model of order n.. Due to the fact that param- 

J J 

eter changes, which are expected in the system from the aging of 


G(e) = G(0) + 


[S <“-] 
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components, the changing of the environment, etc., cannot be reliably 
predicted, we may wish to provide a means to update the parameters 
presumed within the fixed structure of the model 4 ^-^ • There are on- 
line and off-line techniques to accomplish this. Reference 1, and we 
mention here only one basic scheme. 

If we label the uncertain parameters, Pjl>P 2 Pj.» then state 

estimation of the augmented system 

= A^(p)x^ + B^(p)u° 

p = 0 (3.9) 

. mV 

in response to real measurements z°(t) may provide better values for 
p. This problem is difficult because it is now a nonlinear problem. 

The method also cannot account for other types of model errors such 
as external disturbances and truncated modes. Astrom^ has shown that 
increasing the order of the (linear) model can compensate for param- 
eter errors, without adaptive and nonlinear methods. Another way to 
increase the system order is discussed next. 

3.6 External Disturbance Point of View 

Instead of concentrating on errors in the parameters of model , 
suppose we consider the effect of an external disturbance w(t) on 

x^ = A^ x^ + u° + w(t) (3.10) 

3.6.1 Deterministic Disturbance Characterizations 

If we can construct a differential equation which w(t) is known 
to obey 
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w = Pz 


(3.11) 


z = Dz 

then we can augment this disturbance system to and obtain the new 
system 



(3.12) 


2 3 

which was studied by Johnson and Johnson and Skelton, yielding an 

approach which came to be known as "Disturbance accommodating Con- 

4 6 5 

trollers," Johnson. Davison, Bhattacharyya and Pearson also studied 

this problem. In fact, it was the writing of a differential equation 

which an input (command or disturbance) obeys that led to the solution 

36 

of the servomechanism problem in Bhattacharyya and Pearson. The 
"tracking" of a time varying command or the "rejection" of a time vary- 
ing disturbance both reduce to a linear regulator problem once the dif- 
ferential equations that each obey are adjoined to the model equations. 
When command inputs and disturbance inputs were considered simply as 
functions of time, solutions of the servomechanism problem and the 
problem of accommodating disturbances were both termed "unrealizable" 
owing to their dependence on future values of these inputs. Reference 
37, page 798. 

The above approach presumes that the disturbances do not directly 
contribute to the measurements. This limits its use in the accommo- 
dation of disturbing effects which are not independent of the state 
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variables already in the model, such as truncated modes (which can 
affect the measurements directly) . 

3.6.2 Stochastic Disturbance Characterizations 

In the absence of better data we might simply describe the 
disturbance w(t) in (3.10) as white noise and take a stochastic 
approach to the problem* Stochastic approaches strive for good 
(optimal) performance in an average sense. A particular experiment 
might fail to meet requirements even without model errors. The 
expense of the experiment iJlaces a proportionate degree of responsi- 
bility on the control design(er) * Butj obtaining reliable statistical 
data for the system is often difficult. This fact, together with the 
fact that the theory for linear Gauss-Markov models is so well devel- 
oped (and with a lot Of encouragement from the central limit theorem, 
Reference 38, p* 96)* leads to widespread use of the linear quadratic 
Gaussian (LQG) approach (see Reference 39 for an entire journal issue 
devoted to this subject)* Three problems of the LQG approach which 
are of cdncern in the present research are: 1) Pursuing "long term” 

goals (the probabilistic approach) can lead to dire Consequences in 
the "short term." (Successive short term "failures*' can lead to 
system instability and Kalman filter divergence (denying long term 
goals) ; 2) The assumption of Gaussian distributions can be restrictive 
3) The assumption of white noise can cause problems (section 2.2 
discussed some effects of correlated disturbances which are modeled 
as white) . 
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3.7 Adaptive Approaches 


The basic question motivating most adaptive approaches is "Are 
there better values for the parameters (within the fixed model 
structure assumed) than those which were chosen a priori?" This 
question applies equally to deterministic and stochastic models. In 
Figure 2-1 and previous discussions we referred to parameters 
(A'^ , C^, H^) , K , and G. It is these parameters that are avail- 

able for adjustment in either the deterministic or stochastic case, 
although in the stochastic case K is usually changed via changes in Q 

/*s A 

through the K(Q) relation shown in Figure 2-2. The various algorithms 
employed to change the parameters will not be discussed here, but we 
are interested in the question "Does there exist any set of parameters, 
within the model structure assumed, which will stabilize (or make 
acceptable) the controlled system?". 

3.7.1 Deterministic Parameter Identification 

In deterministic problems the model usually used for parameter 
adaptations is a "minimal realization" Reference 40, p.94, (a control- 
lable, observable model) . The limitation faced by such a scheme 
relates to the following arguments: 

a. It is reasoned in this report that physical dynamical 
systems are not completely controllable. (Supporting arguments will 
be offered) . 

b. Input/output relationships obtained in a controlled experi- 
ment (i.e., all initial conditions are zero) can lead one to select 
"minimal realizations" to describe the system. However, in actual 
operation of the system in its natural environment the uncontrollable 
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modes (which are always present) can be excited and there may not exist 
any set of parameters for the minimal realization which will yield 
acceptable closed loop operation. In the model reference adaptive 
control schemes, the objective is to force the physical system (a very 
large order "model") to behave as a low order system. Parameter 
adjustments are sometimes used to find the best low order approxima- 
tion to the (high order) physical system, and usually complete control- 
lability is presumed, as noted above. 

3.7.2 Adaptive Kalman Filtering 

Among the approaches to prevent Kalman filter divergence the 
most common theme is to prevent the Kalman gain from reaching its 
steady state value (as determined by the model, see Figure 2-2, 
thusly, "instructing" the filter to discredit the model. One way to 
do this is to artificially, or otherwise, increase the intensity of 
the plant noise (increase the covariance matrix Q in Figure 2-2, 
Reference 17-20). Most Q selection procedures are based upon; 

• comparison of predicted with actual statistics of the measure- 
ment residuals. References 17 and 18. 

• a priori experimentation, Reference 19, 

• sensitivity approaches, References 41, 42, and 43. 

/\ 

Increasing Q can sometimes partially compensate for unmodeled corre- 
lated disturbances, although estimates of states of a stable system 

A 

may also get worse with increasing Q, Refarauce 22. Steady state gain 
prevention may also be accomplished by measurement "aging" (viewing 
the older measurements with less confidence) . Exponential aging is 
proposed in Reference 44, and linear aging is used in Reference 45. 
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Jazwin,ski and Crump use a limited memory filter to serve the 
same purpose. 

Finally, We are left with a basic requirement for the success- 
ful application of these adaptive techniques; that the model assumed 

II * 

possesses an appropriate structure. Thus, errors in structure 
(that is, the order of the assumed model is too low) can be more 
critical than errors in parameters because the former errors may not 
be compensated even by adaptive techniques. Likewise, in the LQG 
problem, when correlated disturbances have been modeled as white noise 
there may not exist any values of the system model and covariance 

. A 

parameters (A,B,C,M,Q,R,P^) which will yield a stable system even with 
adaptive techniques, Reference 22. Correlated disturbances (either 
w(t) or v(t) in Figure 2-2) may, of course, be modeled by a Gauss- 
Markov process. Reference 18, adding order to the system model, but 
it is not clear how to select such a model. Astrom^ has also pointed 
out that increasing the order of the model can compensate for param- 
eter errors, and we have above discussed the fact that the converse is 
not true (changing parameter values may not compensate for errors in 
model order) . Bryson and Henrikson^^ and Stubberud and Stear^^ 
accommodate correlated disturbances in the measurement process ( v(t) 
is correlated in Figure 2-2), by differentiating (in continuous models) 
or differencing (in discrete models) the measurements to make new 
"measurements” white. Differentiation can create practical problems 

■k 

We must keep in mind the fact that there are no well developed 
theories to help one determine what is an "appropriate" model for the 
particular control task at hand. 
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because of noise amplification. A more serious handicap with this 
approach, however, is due to the fact that the system is more sensi- 
tive to disturbance modeling errors than is the augmentation approach 
using Gauss-Markov models. This notion will be made more precise in 
the sequel. 
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A.O Control by Model Error Estimation 


In the discussion of the last section it was pointed out that the 
structure (order) of the model was very important to the success of 
the control objectives. Several of the schemes discussed were con- 
cerned directly with changing the model order and we will pause briefly 
to show the relationship of these methods with the one to be proposed 
in this section. 

Refer to Figure 1-1 where the proposal was made that models be 
constructed in two steps. The first step ^ model 

reduction (or in the language of the dynamicist, mode truncation) 
which left us with a model which might be quite sensitive to the 
model errors associated with truncated modes, neglected external dis- 
turbances, and parameter errors. The second step of the proposed 
modeling plan of Figure 1-1 requires the expansion of the model (but 
not up to the order of with the hope that the added modes , whose 
description is to be accomplished in this section, would lend to the 
composite system a degree of model error insensitivity. Now to be 
more precise. The singular perturbation approaches of Kokotovic^ and 
others, provide a way to obtain the approximate solution for the higher 
order system controller based upon considerations of a lower order 
model. (Specifically, the method yields, for the linear regulator 
problem, an approximation of the larger dimensional, n^ x Riccati 
matrix based upon asymptotic expansions of the Riccati matrix in terms 
of a small parameter C. When C = 0 the state equations reduce from 
order n^^ to order n^.) This then, provides one method to obtain an 

That is, two steps after the construction of the conservative model 
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model; consider the model from which is a truncation and 

implement an approximation of the higher order controller. Care must 
be taken to restrict the model to an order which can be implemented 
(in the state estimator of the controller) ; usually this restriction 
requires a low order for controllers useful for spacecraft, for 
Instance, The other methods to be discussed, including the one out- 
lined in this research, begin with the smaller model although the 
presumption is made that represents a truncation of a higher order 
model, (even though we might never actually construct and is 
attenvdad with the associated model errors. 

The sensitivity methods of Cruz, Perkins^ and others provide 
another way to construct a modelj ^2 from the model ^nd that is to 
augment to the parameter sensitivity subsystem (the s = f(p,x,u,s) 
equations in (3.2b)). This sensitivity subsystem is generally uncohtol- 
lable, as is the "model error system"' to be proposed in this section. 
However, a key difference between the methods is that the sensitivity 
variables s(t) are desired to be minimized (to be penalized in the cost 
functional of (3.1)) and their uncontrollability therefore creates 
difficulties. Hie variables of the "error systems" described below do 
not pppear in the cost functional and are only added to the system to 
make the estimates of the state variables (of^ei^) more reliable. 
Therefore their un controllability creates fewer problems. 

One further disadvantage of the sensitivity method of constructing 
fs to be noted. This approach expands the model from^t^^ tox ^2 
based solely upon information contained inj^^, the truncated model. 

As mentioned in Section 3.0, this disadvantage is not shared by the 
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singular perturbation methods or the method proposed in this report, 

both of which "consult";*/^ before constructing 4 ^ 2 * (Actually , 4 ^^ ^“^2 

in the singular perturbation method.) 

There are numerous papers which concern themselves with feedback 

through dynamical compensation, and as such can be considered as 

methods to obtain an augmented system ^ 2 * Johnson and Johnson and 
3 

Skelton provide a controller which is based upon the construction of 

an observer to estimate the states of the equations that the external 

disturbance is assumed to obey. Thus, in that paper jdl 2 obtained by 

augmenting to the equations of the external disturbance. Johnson 

labels such controllers "disturbance accommodating". Davison also 

accommodates external disturbances and command inputs which satisfy the 

49 

(same) given differential equations. Bhattacharyya and Pearson, 
include both equations for external disturbance and input command 
signals and convert the resulting servomechanism problem (via the 
transformation to an "error system") into a linear regulation problem 
which is then solved. Algebraic conditions are given for the existence 
of a stabilizing solution assuming availability of all states of the 
composite system (i.e. perfect state estimation). Some sufficiency 
conditions are also established for the existence of a stabilizing out- 
put feedback controller. A conclusion of Reference 49 is that the 
model (which is now the composite of the disturbance equations, 
and the input command equations) is not always stabilizable by/state 

Bhattacharyya and Pearson's "error systems" describes quantities which 
the control designer wishes to drive to zero. The model "error 
system" of this report, which x^7ill shortly be defined, is not the same. 
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feedback. The present study begins with the condition of 
uncontrollable and even unstabilizable models argues that a 
practical concern is the stabilization of a model different from ^ 2 * 

In the sequel the model is used only for determining the desired 
structure of the dynamical controller (control law plus state estiina- 
tor). Also the papers described above say Very little about how one 
should go about constructing the models which are augmented to jd'y It 
is a primary concern of this report to provide a procedure for 
determining the parameters of the model ;^ 2 » given the model This 

emphasis makes this study primarily one of modeling methods rather 
than one of new control theory development, as noted previously; how- 
ever the inherent interdependence of the modeling and control problems 
obscures this distinction. Now, let us proceed tp develop the "model 
error system" which we will augment to to form/i!^. Firstly we must 
provide suitable definitions of the model error. 

4.1 Model Error Definitions 

In this research the letter z always denotes the measurement 


vector, and the letter y always denotes the output quantities which 


are desired to be regulated by control. In other words, for the 
optimal control problems we will discuss, y' appears as those .variables 
associated with model jd-. which we penalize in the performance functional 


1 

J[u®] = I y^(T)'^H y^(T) + -| f (y^^Q y^ 

•'o 


+ u®"^R u°) dt 


yJ C 


0 < k < n. 

3 
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where Q, R are always positive definite, and H is positive semidefinite. 
The variables z*^(t) i = 1 - Jl are those variables which are actually 
measured (as nearly as the model can predict). The vector y^CR^ 
where 0 s k s n . . Thus, one can sometimes choose k = n. if, he wishes 

j 2 

to obtain that extra quality of smcothness in optinial'' trajectories 
x(t) afforded by the least squares theory when all of the iv states of 
model are penalized. The superscript j means that the variable is 
associated with the model . The notation u°(t) denotes the control 
actually input to the physical system. The notation z°(t) refers to 
that time record of measurements actually obtained from the physical 


system. 


Consider the model ji. 


+ B^u° 

[‘"'I 


x^ C R ^ 
u C R 
yi C R^^ 

z^ C R^ 


(4.1) 


and the model 


•X .X X 1 O X 

X = Ax xBu + w 


/ \ r ni 1 i 

1.1= C X 

kJ 


n , 

x^ C R "• 
y° C r"’ 
y C R 
z C R 


(4.2) 


Suppose > n^ and consider a particular similarity transformation on 
.*sl. such that 

X 




T ^A^T = 


^ \2 

^21 ^22 


(4.3) 

(4.4) 
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T~^B^ = 


i/t = [M^O] (4.6) 

G^T = [C^O] ' (4.7) 

where all matrices without the j superscript are arbitrary. Note that 
by counting the constraint equations, 


(4.5) 



II 

(n^ eqs.) 

(4.8) 


= 

(n^m eqs. ) 

(4.9) 

M^T 

- IM^OI 

(n^ii eqs.) 

(4.10) 

cS 

= tC^O] 

(n^k eqs.) 

(4.11) 

t”^t 

= I 

(-|- n^(n^+l) eqs.) 

(4.12) 


A necessary condition for the existence of such n^ unknown of T 
is obtained , requiring the model to be sufficiently larger than the 


model jdtj such that 


^i - 


2Ck+£)-H 


/ 8 n . Cn .+m) 

1 + / 1 + — ■ 

>/ (2rk+£)+lV 


(4.13) 


V C2Ck+£)+l)^J 

It is helpful in gaining insight in what follows to assume that such a 
T exists although the main results do not depend upon it. In fact, 
such a T need not ever be computed to utilize the procedures suggested 
in the summary of Section 5.0. Note also that if the constraints on T 
are relaxed by ignoring (4.11)* then, (4.13) is relaxed to 

9P4.1 r / 8 h,Cn,-hn) 1 .. .. 


n 

i - 2 


1 + /I + 


L V (2JI+1) 

^ 2 

The requirement n^ ^ (2(k+Jl)+l)n^ + 2 nj(n^+ m) yields at least as 
many unknowns as constraint equations. 
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and by ignoring, instead, the constraint (4,10), (4.13) is relaxed to 

(4.15) 


2k+l 

^i ^ [ 


/ 8n.Cn,+m) 

1 + /l + -J J . 


w (2k+l)‘ J 

Prompted by these observations we offer the following definitions. 


Definition I: If there exists a vector, e"^ (t,x ,x'^) such that 

’ z * o* o 


^^^t,xSxj; 
z * o* o 

z^(t) = z^(t), where z^(t) obeys (4.2) and z^(t) obeys 

x^ = A?yp + B^u° + e'^^(t) 

z 

= M^x^ 


(4. 16) 


then such a vector is called a "model error vector with respect to 


measurement . 


i i i i 

Definition II; If there exists a vector, e*' (t,x .x"*) . such that 

y o’ o 

iL *1 X *1 

y (t) = y (t) , where y (t) evolves from (4.2) and y'*(t) evolves from 


the system 


x^ = A^x^ + B'*u'^ + e^'^(t) 
y^ = C'^x^ 


3 0 




(4.17) 


then such a vector is called a "model error vector with respect to 
output". 

i i X i 

Definition III: If there exists a vector e'’ (t,x , x ) such that 


/y^t)\ _ hht)\ 
\z^(t)/ \z^(t)/ 


(4.18) 


where 


/ y^(t)\ 


evolves from (4.2) and 


/y^(t)\ 


evolves from 


x^ = A'^x^ + B^u° + e^^(t) 



(4.19) 
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then such a vector is called a "model error vector with respect to out- 
put and measurement". (In the sequel it is this model error vector, 
which finds the most frequent use and it will be convenient to 
shorten the name to "model error vector".) 

The first observation one can make concerning these definitions is 
that if no restrictions are placed upon the functions of time, e (t) , 
e^^(t) , e^^(t) , then all such model error vectors defined above exist, 
assuming observability of the model both in output and measurement 
(the pairs (A^ ,M^) and (A^,C^) are observable). To show this, consider 
only the definition of e^^(t). The output of differenced from the 
output of , is given by 


^^(t) - y^(t) = C^$^(t,o)x^ - G^$^(t,o)x^ 


+ 


t ( ) 

J c"^<5^(t,a)(BV + wS - C^$^(t,a)(B^u° + e^^)| 


do 


(4.20) 


The requirement y^(t) = y^(t) is satisfied by the choice 

e^^(a) = [C^$^(t,0)]^ |[C^$^(t,o)x^ - C^$^(t,o)x^]5(t-a) 

+ c^$^(t,a)w^ + [C^$^(t,0)B^ - c^$^(t,a)B^ ju°(a)| (4.21) 


where S(t-a) is the Dirac delta function and T denotes the generalized 
psuedo— inverse of Penrose. The matrices $ and denote the state 


*That is, if the functions are allowed to contain impulses, doublets, 
etc. 

**Note that (t,a)e^^(a) = 0 if and only if e^^(a) E 0 under the 

assumption (actually the definition) of observability of the model 
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1 i 

transition matrices for A and , respectively. Note also that the 
above definitions of model error are relative to two models (of the 
physical system) and thus have nothing to do with the physical systems 
itself. 

i i i i 

Now let us view the error vector, o'* (t,x'^,x.), from another 

o’ j ’ 

perspective. Consider the transformed system in (4.2)-(4.7) and see 
that the vector which can be added to the plant equations of model 
to make its outputs identically equal to those of the larger (and 
presumed better) model is given by the "error system" 


e^^(x'^ ,x^,t) = At„x^ + <^-w^ 
o o 1 


~ ^22^^^ A 2 j^x^+ B^u°+ 


I 

A -j"! , 




B^ 


A 


(4.22) 


where the transformed n^-vector, J , in (4.3) has been partitioned with 


the designation 


■c.) 


n . 

J n J 


X-" c R 


x" c R 


n .-n . 

1 1 


(4.23) 


To see this simply write the model in (4.2) in the transformed 

coordinates (4.3)-(4.7) to get (writing the separate equations for the 

t. 


partitioned parts x^ and x^"). 

I P = A^x^ + B^u° + e^^(t) 


( 




\ 


x^ = ^ 22 ^ ^21^^^ B^u°+<§’2W*| 

e^^(t) = \2^ 


(4.24a) 


(4.24b) 
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and compare (4.24a) with (4.19) and (4.24b) with (4.22). We have 
presumed also that the initial conditions, associated with in 

(4.1) have also been applied to the n^-vector in (4.24a). 

If we wish to make reference to models which approximate the 
physical system (in an arbitrarily small region around the null solu- 
tion of the (linear) model equations) well enough to serve as an 
evaluation model then such models will necessarily be of high order 
so that the condition (4.13) is satisfied and the interpretation of the 
model error vector in (4.19) is useful. Use of the transformation 
defined in (4.3) clearly focuses the fact that, under the conditions 
discussed above, there exists a model error vector which can be added 
to any linear model which accommodates the effects of neglected 
external disturbances, w(t) , truncated modes, corresponding to the 
truncated states x^(t) and parameter errors. To appreciate the 

inclusion of parameter errors in this statement, note that only the 

i i 

parameters of the transformation T would change to give Ar + AA“^ , 

+ AB^ , on the right hand side of (4.4)-(4.7), changing the second 
equation of (4.24b) to 

e^^(t) = AA^x^ + AB^u° + ^22^^ (4.25) 

Parameter errors in and are accommodated in the output and 
measurement residuals to be defined in x^hat follows. 

An "evaluation” model is one which represents the physical system well 
enough to be used to evaluate candidate control schemes via analysis 
or simulation by computer in lieu of physical system testing. 
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Any approximations of the model error vector which we will 
subsequently use will have residual errors associated with them, 
requiring still further definitions* Suppose an approximation of e (t) 
is written e^^(t) (when, in the sequel, it is clear which two models 
A,. and*rf. are being referenced Che j.i euperscrlpte will be oeitted) , 
then define a residual model error 

_ gji (4.26) 

Now the relative residual errors in the outputs and measurements are 
defined as a consequence of the residual in the model error and are 
defined by 






f $^(t,a) i'^^(a)da 

j •'o 


Now it follows that 




Thus the relative output and measurement residuals are defined by 

i X i X 

(4.27) if one, in turn, recalls the definition of e-^ and e . 
Equivalently, and more conveniently, one can take (4.28) as the 

X i 

definition of the relative output and measurement residuals, y , . 

Finally, the measurements evolve from the physical system and not from 
the model of the physical system. Define any difference by 


-X \ to X 

y \ ^/ y - y 

/ \ z° - z^ 


Q ^X X 

y - C X 


z - M X 
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The error relative to the physical system (called the output or 
measurement residual) and the etrot relative to two models (called the 
relative output or measurement residual) can be related by, using 


(4.28)* (4.29) 





^jij 


or by the ’’chain rule’’ 



(4.30) 


It should be noted that if the models of this section were 
restricted to the unforced case (w‘(t) = 0 in (4.2), and e^^llt) = 0 
in (4.19)) then the model matching results of Reference 50 can be 
applied* to wit; 

Lemma 1; (Moore and Silverman^*^ * p» 492) 

Refer to models in (4.1) and 4 ^, in (4.2). Let 
J 1 


m ^ k 


and define 


A A 1 
0 = rank 

o 


6^ = rank 
o 


6^ 

P 


[(A^ (cV)"" (cV 

. X r • . . .(n.-l) .1 

^ = rank ^ B^ J 

] 


6^ = rank (C^) * (C^A^) *, 

o L 


Then the model is ’’zero state equivalent in the output” to^^ 
if and only if 
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(4.31) 


A *1’ i 

C^(aJ) = C (A ) 


for 


a == 0,1,... V where 

\&^ + 6 ^- 1 , + 6^-1 


V = max 


A corresporiding result of "zero state equivalence in the measurements" 
is available by replacing by everywhere in the above lemma. 

"Zero state equivalence in the output (measurement)" means y^(t) S 
y^(t) , (or, z^(t) H z^(t)), if the initial conditions, x^(o) and x^(o) , 
are all zero. In the physical system it is not usually possible to 
keep the initial conditions (which correspond to all the variables Xj^, 
k=l, zero, especially for large n^. For instance, in spacecraft 

the structural modes of vibration are unavoidably excited by noisy 
actuators (driven by noisy sensor measurements) , and other disturb- 
ances. Moreover, some of these modes (there are actually an infinite 
number of them in the physical structure) will correspond to those 
truncated in the reduced model .cT. Therefore, input/output relation- 
ships of model cannot account for x^(o) ^ 0 (see (4.23)). Also, 
input/output relationships cannot account for uncontrollable states 
which are reasoned to be present in all physical dynamical systems. It 
is for these reasons that input/output procedures will not be pursued 
further here. 

In the following section we wish to consider a description of the 
synthetic modes assumed for the error system. 

4.2 Selection of the Synthetic Modes for the Model Error System 

Recall that the exact control problem which we would like to 
solve, neglecting irapleraontation constraints, is the minimization of 
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J(u°) = 


(4.32) 


X 

f + u°^R u°)dt + -- y^(T)’^H 


(T) 


subject to the very large model 


.1 .1 1 , ^1 o , 1 

X = Ax + Bu + w 



M 


1 - 

X C R 

1- „k 

y C R 


\ 1 1/- 
X (o) = X z C R 
o 


(4.33) 


where the coordinates of (4223)* (4.24) are presumed. Now (4.32), 
(4»33) can be considered the exact variational problem which we can 
only solve by approximate solution methods, owing to its large dimen- 
sion. Alternately, this problem can be replaced by an approximate 
problem which we can solve exactly . 

The approximate problem we might solve is as follows: Consider 

the decomposition of (4.33) as given in (4.3)-(4.7), (4.24) but instead 

of utilizing the exact error system (4.24b) to generate the vector 
3X 3X ^ 

e'^'^(t) j, "curve fit" e "^(t) with a set of d approximating functions 


f (t) , i = 1,2,. . .d. 


d 

e^^(t) = E p^f.(t) = P°f 
i=l 


e^^ C R ^ 


f C R 


(4.34) 


We must pay particular attention to the initial conditions used in 

31 

(4.33) which are used to generate the particular e (t) we wish to 
approximate, as discussed in the next section. 


The smaller model is labeled larger model in this discus- 
sion is A see Figure 1-1. Hence, through this section e^^ becomes 

31 ^ 

e . 
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4.2,1 Model Error Compensation by Open Loop Signals 

In this section (4,2,1) we discuss a technique of model error 

compensation which we will not later use and this section only provides 

a basis for comparison with the method (in the following section, 4.2.2) 

which we do intend to incorporate into our overall design scheme. 

For a given e (t) and a given set of functions, f^(t) , the 

coefficients P of (4.34) can be chosen to minimize the least squares of 

the error, e (t)-e (t)=e. 

T 


X 


J(P^) = I e(t) g(t) e(t)dt 


or the least squares of error plus error rate 

T / \ 


(4.35) 


J(F°) = f (e^ e^)Q^(t) j dt (4.36) 

where g(t) is a weighting scalar, Q^(t) is a positive definite matrix 
V t C (A,T) and where T is the entire interval of the control problem, 
(see (4.32)). A shorter interval for consideration in (4.35) will be 
discussed in Section 4.3. 

The use of least squares methods must be carefully considered so 
as not to allow large instantaneous errors (4.36) (and even sustained 
oscillations of large magnitude). The criteria will enforce a certain 


: 31 

The presumption of the availability of e (t) has a serious conse- 
quence. Nothing has been said so far which relieves us of the duty 
to actually construct the larger model select a truncation (a 

31 

reduced model) and generate e according to (4.24b). Some relief 
from the pres^^of this requirement is found in the next section. For 
a guess for e (t) , one could "assume some modes" which are believed 
to be present in (4.24b), such as, for example, some structural modes 
of vibration which have not been included in the truncated model-^jf^- 
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smoothing quality by simultaneously keeping error rates and errors 
small. Alternately, the approximating functions, f^(t), of (4.34) may 
themselves be selected so that even (4,35) yields certain smoothing 
qualities, and this is a subject of Section 5,0. For the present, we 
will consider (4.35) for the determination of P, noting that (4.36) may 
he a wiser choice for arbitrary selections of approximating functions, 
f^(t). The P which minimizes (4.35) is given by (see Appendix A) 



Therefore, one approximation of the original problem (4.32), (4.33) is: 




where P° is specified by (4.37). 

The solution of this type of control problem (with the forced 
plant equation this was originally called the servomechanism problem) 
can be found in a number of places. For constancy of notation see p, 15 
of Reference 3. 


g 31 

When an error vector, e"’ , has been added to the (truncated) model 
the new model will be labeled even though some parameters of 
are retained from-el^j see (4.38). 
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T 

u° “ “ r"^ (Kx^ + h(t)) 
where h, K satisfy 

rp /m m 

• 3 3 3—T3 33 

K = - KA - A K + KB R B K - C QC K(x) = H 

h = - A^ j h - KPf h(T) = 0 

^ 3 

where it is assumed that the pair (A^jC ) is observable and the pair 
3 3 

(A B ) is controllable. The difficulties with this approximation of 

problem (4.32), (4.33) include: 

31 

1. The error vector, e (t) , which includes truncated states and 

external disturbances must be reliably predicted over the entire con- 

31 

trol interval. (If c (t) is considered as only an external disturb- 

31 

ance then periodic updates in P and f may be helpful.) Thus, e (t) 
can be quite sensitive to the system (and model ^^) initial conditions, 
see (4.24b). 

2. The control problem for the larger model, may have to be 

31 11 

solved to obtain a suitable e (t). (See Rodgers <ind Sworder for a 

method to obtain a smaller model by solving the control problem for 

both the small and large models and minimizing the difference.) 

31 

We would welcome now a scheme for con?)uting e (t) without having 
31 

to predict e (t) accurately for all time, tC [0,T]. We wish, for 

example, to allow for changes in parameter values from time to time, 

changes in the nature of external disturbances, and changes in the 

initial conditions of truncated modes (see (4.25) for the way in which 

31 

these affect the actual model error vector, e (t)). The a priori 
/\31 

specification of e (t) ,sV t C [0,T], as presented in the "open loop" 
method above, precludes such flexibility. The fundamental difference 
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in the approach of the next section is the recognition of the fact that, 
as far as the construction of a state estimator is concerned, predic- 
tion of model errors only over a short time into the future is of 

interest. This fact allows us the freedom of modeling only those 
31 

modes in e (t) which appear to be present in a preselected (short) 
interval of time, x « T. Naturally, it follows that the choice for x 
is an important parameter in the model. By relating x to certain 
properties of the control problem a (modest) mathematical relationship 
is established between the modeling problem and the control problem. 

Now soon we will restrict our attention to the class of functions, 
e^^(t), which are d^^ differentiable, because we wish to approximate 
e^^(t) as the output of a d*"^ order differential equation. Hence the 
existence of e'* (t) which was guaranteed for unrestricted functions of 
time, does not carry over to the fionctions e^^(t) which are restricted 
(to be differentiable). However, in many practical situations the 
physical system is sufficiently well behaved to consider only (linear) 
differential equation models of all effects. In the following section, 
however, we do not yet restrict ourselves to differential equations to 
describe the synthetic "modes" assigned to the error system. 

4.2.2 Model Error Compensation by Closed Loop Signals 

In this section we propose an approximation of the model error 
vector 

e^^ = Pf (4.40) 

which allows the coefficients, ^he preselected function f^(t), 

i = l,2,...d, to change with time (P must change Slowly compared to a 
preselected time interval, x, where X will soon be defined). Hie 
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raatrix P is defined similarly to (4.37) except that T is replaced by T 


= J' g(t)e^^(t)f^(t)dt 


T 

/ 


g(t)f(t)f^(t)dt 


-1 


(4.41) 


Now construct a differential equation which f(t) obeys. Find a matrix 
D such that f(t) is an eigenfunction of the system, 


Y = Dy 


(4.42) 


Now replace f(t) in (4.40) by a set of "synthetic" variables y(t) C R 
and write for the error vector approximation the "error system" 


e^^ = PY(t) 

Y = dy 

where y(.t) in (4.43) is any solution of (4.44). Then 

^31 . 

e = Pe Y(t ) 

o 


(4.43) 

(4.44) 

(4.45) 


where the eigenmodes of the matrix D represent the synthetic modes 

31 

assumed to be present in e (t) , over an interval t . This construction 

of the error system relieves us of the responsibility of predicting 
31 

e (t) for all future time, tC [0,T], as required in (4.37). The task 

31 

of "assuming modes" which characterize e (t) only over short intervals 

is a far simpler (realizable) task. 

If Y (see (4.45)) can be updated in real time by state esti- 

/s31 

mation of the error subsystem (4.43), (4.44) then the estimate e (t) 
will not be a function of time which is fixed a priori as in the case 
of Section 4.2. In this way another alternative approximation to 
problem (4.32), (4.33) is formed; Minimize 


J( 




Qy^ + u° Ru°J dt + y^(T)^Hy^(T) (4.46a) 
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subject to the model 

A. 


.2 .2 2 . ^2 o 

X = A X + B u 


C 


(4.46b) 


where 


2 A / X' 
Y 


X 


P(T,d) 
0 D(T,d) 


= 


A“ = 

= [C^.O], = rV.O] 

>3 ^3 


X : n^- vector of physical 
variables 


y: d - vector of synthetic 
(model error) vari- 
2 ables 

X : + d = U 2 - vector 

3 3 

again observability of (A'*,C’) and controllability of (A ,B ) are 
assumed and where P(T,d) is found by (4.41) , and y(t) is to be esti- 

3 

mated along with x (t) in real time. The current raOdel error vector 

estimate will be e^^ = PY(t). The solution to this problem is given 

** 

by 


o „-l „2 „2 2 

u = -R B K X 


(4.47) 


where K satisfies the matrix Riccati equation 

T T 

•2 22 22 22-122 22 

K = -K A - A K + K B R B K - C QC 

T 

2 2 2 

K (T) = C H C 


(4.48) 


The order of the error system, d, and T, an effective "observation 
window" are as yet unspecified, 

Tne initial condition x (t^) = x^ in (4.46) is presumed known and 

Y(t ) is actually unknovm. However in the solution of the control 

P 3 

problem, (4.47), y(t ) is assumed known. An estimator for x (t), 

y(t) will be constructed in Section 4.5 which will converge quickly 
compared to t. The enforcing of this "separation principle" between 
the control and estimation problems must be done with care and more 
will be said about this in the sequel. 
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(In the next section a partitioned form of this equation will be 
examined. ) 

The addition of the "synthetic modes" (4.43), (4.44) to the model 

(4.1) to form the model ^2 (4.46) is. similar in spirit to the 

66 

"synthetic modes" Likins uses in the modeling of dynamical structures 

to account for incorrect coupling forces between the rigid body and 

flexible appendages when the model has been truncated by some other 

criteria (such as keeping "lowest frequency" modes). The method also 

has some conceptual similarities with the "assumed modes" and Rayleigh 

Ritz procedures for modeling dynamical structures p. 253, although 

in the present work the assumption of mode "shapes" in (4.43) is in 

time rather than in space as in Rayleigh Ritz procedures. One choice 

for the matrix D, for instance, is to "assume" some eigenvalues , 

i = l,...d of the matrix in (4.22) and write D = 

The eigenfunctions of this matrix then form the 

definition of the approximating functions, f, in (4.40). 

Even for a given dimension for such as might be imposed upon 

a state estimator synthesis, it is not always clear how many modes to 

3 

associate with the physical variables x and how many to allow as 
"synthetic modes" in y(t). The answer in a particular problem will 
depend upon the f^(t; chosen and the uncertainties associated with the 

model 

For example, suppose the object to be controlled is a free dynami- 
cal structure (such as a spacecraft) and there is some uncertainty in 
the (small) modal damping assigned to a particular mode in the vibration 
equations. Then if the frequency of vibration, w, is reliably known 
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the mode in question might be truncated from the physical variables in 
the model (delated f rora and considered in the synthetic variables 

Y^(t) of the error system (4.43), (4,44) by choosing 

0 03 

D = (4.49) 

“03 0 

3 

The resulting control poliicy (4.47) (where x y. Is replaced by the 
/n3 '' 

estimates x ,y from the state estimator to be designed in a later 
section) , can lend to the closed loop system an insensitivity to the 
modal damping (and even occasional changes in the damping) that is not 
enjoyed by the closed loop system in which the mode together with its 
assumed damping) is not triincated from but included in the model 
If the frequency of vibration and the damping are both uncertain then 
it might be more advantageous to use the error systems of Section 5.0. 
This is possible because the amplitude of the assumed oscillation is 
a function of the initial conditions of the synthetic variables (solv- 
ing (4.43), (4.44) with (4.48)) 

e = C. (y ) sin wt + C. (y )cos wt 
1 o / o 

Thus, the use of state estimators to update y^ allows an effective 
"tracking” of the actual amplitude of oscillation, assuming, in this 
example, that the damping is small so that the amplitudes change 
slowly, compared to the T which we must specify in Section 4.4. 

Now that some aspects of my approach to control by model error 
estimation have emerged, let us pause to clearly state the questions 
that are before us and see what tasks lie ahead. 
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1. We wish to model the error system by 


e = P(T,d)Y 
Y = D(x,d)Y 


Where do we get P,T,d, and D? 

2. We wish to solve the "control" problem 

o 2 2 A 

u = G X ^ \Y 

Where do we get G? 

2 ^ 

3. We wish to design a state estimator for estimating x for 

feedback. The estimator will have the form 

'^t ^t o o 

z =^ 22 ^ + d 2 ^z + Bu 

x^ = + T 2 z° 


Where do we get d 22 » ^ 2 ^, B, T 2 ? 


Now, the specification of G follows in Section 4.3. Away to 
determine T is provided in Section 4.4. The structure of the estimator 
is determined in Section 4.5 and selection of the parameters (d 22 »d 22 ^> 
B, T^,T 2 ) is discussed in Sections 4.6 and 4*7. Alternate choices of 
D are discussed in Section 5.0, together with special cases of the P 
selection method which was discussed in Section 4.2.2. Finally d, the 
order of the error system, is discussed in Section 5.2. 

4.3 The Control Problem (Specifying the Matrix G) 

In the context of this research exactly what is meant by "the 
control problem" deserves some clarification. In Figure 2-1, the 
"control problem" was described as the specification of the matrix G, 
and that is what we intend to do here. However, we must keep in mind 
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that the total "controller" design (see Figure 2-1) is not complete 
until the state estimator is also constructed. Furthermore, both the 
control law design and the. estimator design rely on a model, ^ 2 > 
as yet incomplete (the "observation window',', x, and the order of the 
error system, d, which influence the matrices (see (4.41)) P(x,d), 
D(x,d) have not been determined). We will not, then, complete our 
specification of the matrix, G, in this section but we can write the 
formula for G(P(5E,d), D(x,d)). 

From the approximation of the control problem (4.32) (4.33) which 


is given in (4.46), we can write immediately from (4.47) 

' 3 ^ 

u^^ = G ' ^ 


where 




-1 2 2 
G = - R 


(4.50) 


(4.51) 


THe Riccati equation (4.48) reduces to, upon substitutions from the 
definitions under (4.46b), 

(4.52) 


K = -KA^ - K + K - QC^ 


K(T) = H 


L = [k 


T T 
3 -1 3 3 

L = IKB R B - A 


j L - LD(X, 


d) - K P(X,d) 


L(T) = 0 


(4.53) 


F =-D^(x,d)F - F D(x,d) - P^(x,d)L - L'^P(x,d) (4.54) 

F(T) = 0 


where the n 2 ^ n 2 matrix K has been partitioned according to 
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(4.55) 


It is interesting to note the similarity of these equations (4.52)- 
(4.54) with those obtained from the singular perturbation approach, 


8 


where their "model contain synthetic modes but the modes 

of the un truncated <4> model. Note also that (4.52) describes the 
Riccati matrix for the "error free" model and can be solved inde- 
pendently of (4,53), (4.54). Given K, then (4.53) is linear in L, and 

moreover, given L and K (4.54) is linear in F. Finally, from (4.51), 

2 

together with the definition of B in (4.46), using (4.55), 




-1 3 


ly 


u = ^-R Kx ^ + 1-R B‘ 

= + u°(e) (4.56) 

Hence, the control consists of the sum of two parts, a term which is 
optimal for the error free systems, u' =(4). and a term which feeds back 
a measure of the model error variables, y(t) (recall e = Py) . Now the 


gain matrix G in (4.50) is given by 

T 

G = -r“^B^ [K,L] 


(4.57) 


Note that any model we write for a physical system can be considered 
a truncation from a better model. The point to be made above is that 
if one considers an model only slightly larger than the (truncated) 
model so that A is of the same order as the obtained above (by 
augmenting an assumed small order "error system" to the (truncated) 
model 4 ^ 3 ) , then the computational burdens of the two approaches (sing- 
ular perturbation and model error estimation augmentation) are the 
same. 
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There is some question of existence of G when T Since the 

3 3 3 3 

pairs (A ,B ) , (A ,C ) are respectively, cofttrollable and observable, 

the existence of a positive definite, symmetric K which satisfies 

• . 52 

(4.52) for K = 0 is guaranteed (see for instance, Anderson and Moore, 

for a discussion of Riccati equations). Therefore, we have only to 

show that L exists. Consider the defining equation for L, (4.53), 

L = -LD - [A^ + B^G^j L - KP L(T) = 0 (4.58) 

A -1 3 '^ 

where G^ = - R B K is the gain for the error free system. Now we 

show that L(0) exists if the error free system 

• 3 3 3 3 

X = [A-" + B-^G^] X 

is sufficiently stable. To show this, note that if L(t) has a steady 
state solution, L(0) , which satisfies 

0 = L(0)D - [A^ + B^G^] L(0) - KP (4.59) 

then the difference 

>Kt,T) = L(t,T) - L(0) (4.60) 

satisfies (differentiate (4.60), using (4,59), (4.58)) 

}'(t,T) = -(A^+ B^G^) ip(t,T) - l|;(t,T)D (4.61) 


where the notation L(t,t^) represents the solution of a linear matrix 

equation such as (4.58) , expressed by the matrix variation of con- 

40 

stants formular (see Brockett p. 59) 

L(t,t^) = $^(t,t^)L(tp ^>^(t,t^) -J* $^(t,a) K(a)P$^(t,a)da 


(4.62) 

1 3 3 2 

where $ is the state transition matrix for [-A - B G^J and $ is the 
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state transition matrix for [~D], Now, again applying the matrix 
variation of constants formula to (4.61) we have 


iJj(t.T) = $^(t,T) [L(T,T) - L(0)]^>^(t.T) 


(4.63) 


3 3 

or simply, for constant [A + B G], D, 


[a3+b3g ](T-t) T . 

L(t,T) = L(0) - e L(0) 


Thus L(0) exists only if lim L(t,T) exists which is guaranteed if 

X ->■ OO 

Re X.[D] + Re A.[A^+ B^Gj < 0 (4.64) 

1 J 

Vi C [l,d], j Q [1,0^]. Thus, the existence of L is guaranteed under 

condition (4.64). Now we turn to the task of defining the "observation 

window" T, used in (4.41), and the overall design of the state esti- 
''3 

mator for x , y. 

4.4 Specifying the Observation Window. T . 

In the operation of a state estimator (Kalman or Luenberger) three 
basic functions are performed: 1) the receiving of measurements, 2) 

the prediction of measurements, and 3) the feedback of the difference 
between the measurements and their predictions. The first function 
causally relates the estimates to the physical system. The second 
function relies upon the model of the system, built within the estima- 
tor, to instantaneously predict measurements, so that the estimate of 

The use of the word "prediction" is not here restricted to the sto- 
chastic estimation theory. Fpr instance, in the deterministic esti- 
mator, the quantity = MJ , where x^ is the output of the esti- 
mator, is called a prediction (instantaneous) of the measurement 
(based upon model^.). 
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the state vector can be changed (the third estimator function) if there 
is a difference between this prediction, = M'^x^ , and the real 
measurement, z°(t) . (The difference z° - is called the measurement 
residual, z'^ , see (4.29).) In other words, unless there is a change in 
the "initial conditions" of the physical system the state estimator will 
not change its estimate of the state assuming the model within the esti- 
mator is correct (so that the measurement prediction, M^x^(t), remains 
equal to the measurement, z°(t)). Now if one is concerned about con- 
structing a model upon which to base the design of the estimator, a 
logical question to ask is "Over what interval of time, T, should the 
model (predictions, z^ = M^x^(t)) be accurate?". That is, how long 
should the zero state response of the model match (to some prespecified 
accuracy) the zero state response physical system (see Figure 4-1 for 
the "experiment")? Let us first agree on two extremes. If this time, 

T, is "very long" then in some sense the model is "very good". In 
fact, if one performed some (off-line) performance analysis with this 
model, to determine some appropriate controls, one might, on the basis 
of the above experiment, decide to issue such controls to the physical 
system in an open loop manner (without further use of the model in an 
on-line operation, having previously verified by the experiment that 
the model predictions are "very good"). Furthermore, if one used this 
model to implement a state estimator for on-line feedback control, then 
under the same conditions of the experiment the measurement residual 
would (after an initial transient of the stable estimator) remain near 
zero, z° M^x^ , and the feedback signal would soon be based solely on 
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it 

the prediction of the model. On the other extreme, if the time, 
in the above experiment is very short and approaches zero then the 
model is arbitrarily "bad" in the sense that no model can be worse, 
since there is zero "correlation time" with the physical system. 
Intuitively, one would guess that for stability of the closed loop 
system the estimator (model) need only be accurate over time constants 
of the closed loop system. This discussion prompts two definitions 
which might be used as a characteristic property of the model. We 
define a property called a "characteristic time". 

Definition 4.1; Given a linearized time invariant model, of a 
physical system, 


x^ = x^ + u° 

Define a characteristic time, , associated with the model subh that 

3 

the zero state response of the model, z^gj^Ct) , satisfies 


| lz°(t) - ^ V5€[0 ,t^] 

if ||z°(0) - “ l |z°(0)|| ^ 

see Figure 4-1. This definition of characteristic time of a model is 

it 

Alternately, it can happen in Kalman filters, v7ith small or zero 
assumed plant noise (see Section 3) , that the weighting (Kalman gain) 
on the measurement residual goes to zero because -the estimator "thinks" 
it has learned the model and disconnects itself (K •> 0) from the 
(noisy) measurements, basing all future state estimates solely on the 
prediction of the model. The result, of oaurse, is filter divergence, 
since the model is never perfect. 

To make this definition practical in a deterministic sense some toler- 
ance must be accepted on the initial condition z°(0) of the physical 
system, since this cannot necessarily be controlled to absolute zero. 
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not very useful owing to its empirical nature and the fact that it is 
a function of the small parameters 6^, 6. It is, however, closer to 
the true property of the model (relative to the physical system) that 
is important to us in estimator design than is the more tractable 
definition which follov/s. Note that the above x is not a property 
solely of the model. That is, cannot be determined solely by 
examination of the model parameters (such as eigenvalues of the matrix 
A^). The following is a definition of T which is determined solely 
from parameters of the model and the control law. It will be taken as 
the definition of model characteristic time throughout the remainder of 
the report. 

Definition 4.2; Let V(x^(t)) be a Liapunov function for the asymptot- 
tically stable model of the closed loop system 




= A^x^ + B-^u°(x^) , 


yJ = G^x^ 


u°(x^) given 


(4.65) 


Then the quantity 



max 


V(x^(t)) 

-V(x^(t)) 


(4.66) 


i&ill be called a characteristic time of the model 

J 

This definition of a characteristic time was first suggested by 
Kalman, Reference 53. The proof that there exists such a x^ rests on the 
proof of the fact that there exists, for any system til which is asymp- 
totically stable, a scalar function v(x'^(t)) which is positive definite 
and for which v(x^(t)) is negative definite. Such a proof was provided 
by Messera, see Theorem 23 of Reference 54. It is another matter, 


71 



however, to find such a function. In the following discussion let us 
suppose there Is such a Liapunov function of quadratic form 


V(x^) 


.T , . 


(4.67) 


for the closed loop system model^^ which is linear. 


^3 


x^ = x^ 

y j 53 


(4.68) 


where 




O A 


state feedback gain matrix. 


We have tacitly assumed here that the state variables are available 
for feedback. This is tantamount to the assumption that the state 
estimator is perfect, an assumption which we will not tolerate else- 
where but in this definition of X. Now we cite the useful 
Theorem 4.1 ; (Using some results in Reference 40 p. 128). Let the 
closed loop system (4.65) have the Liapunov function (4.67) which 
satisfies the Liapunov theorem for asymptotic stability (Theorem 1 of 
Reference 53). Then V(x^) is a negative definite function of and 

the model characteristic time, T, defined by (4.66), is given by the 

- —I • *T ' 

spectral radius of the matrix [R^ ] , where A^ A . 

Proof; Extreme values of the scalar function 


T(x^) 


V(x^ (t)) 
-V(x^ (t)) 


(4.69) 


must satisfy 


3x(xj) ^ Q 
3x^ 


(4.70) 
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From (4.67), (4.68), (4.69), 


•T - .T . .T 

3t(x^) _ (K^ 4- ) x^ (R^ 4- ) j j j _ 

^ j “ .f . . “ .T . . 2 ^ ^ ^ ~ 

x^ rJ x^ (x^ rJ x^) 


(4.71) 


1 * * *i ^ * 

multiplying by — x'^ R^x^ , we have, for asymmetric , 

.T . , . 

x^ [K^ - t(x^)R^] = 0 


(4.72) 


.-1 . 


which implies that t(x^) must be an eigenvalu-:; of the matrix R^ . 


Hence 


= max t(x^) « X [R^ ] 

A max 

x*J 

= r[R^ ^ K^] 


(4.73) 


where [*] denotes maximum eigenvalue (magnitude) of [’]» sometimes 

called the spectral radius of [• ] , denoted r[*]. 

Corollary ; If V(x^) is chosen for the system (4.65), as 


V(x^) - jT 


.T . T 
(yJ Q y^ + u*^ R u°) dt 


and u°(x^) is selected as 

u°(x^) = -R~^ x^ 


(4.74) 


(4.75) 


where is the symmetric positive definite solution of 
0 = -K^A^ - r"^B^ Q 


(4.76) 


and where Q and R are positive definite matrices and the pairs (A"^ jB-*) , 
i i 

(A ,C ) are respectively controllable and observable, then the matrix 


A in (4.68) becomes 


= A^ - R ^B^ 


73 



and (4.73) becomes, using (4.77), (4.76), 

= A 


max 


— — — — ^ 9 

Jr* 1 -^ * *1 

1 - r” Q J I 


(4.78) 


where we have used the fact that (4.74) may be expressed V(x^) - 

.T . . 

1 1 i 

X K X ; see p. 277 of Reference 16 for proof of this equivalent 

expression for V(x^). The minus sign in (4.78) can be neglected since 

A was defined to mean the eigenvalue with maximum magnitude, 
msx 

A = la + jO)! . The ’’observation window", x, needed for the model 
max ' inr-u- > 


' max 


in (4.46b) for the estimator design is selected as 


A 3 

T = X 


3 

where x is defined by (4.78) with j = 3. In this way the time con- 
stants of the truncated model, ^re used to influence the error 
syste)jis (4.43), (4.44) which is augmented to define is to 

be used for the design of the structure (order) of the controller 
(estimator and control law)). 

4.5 Design of the State Estimator and Special Cases 

The problem to be solved in this section is the problem imposed by 

the decision to implement a control policy which requires all n. state 

n . ^ 

variables of the model (such as u° = Gx^ , x^ C R ^) when there are 

o * 

only Z measurements available, z (t) , z C R . The relationship of the 
resulting design of a minimal order (n^ - Z) state estimator wl';h the 
standard Luenberger and Kalman estimators is discussed. In particular, 
the unifying approach of carrying aSbong "model error vectors" in the 
design allows one to later consider the errors to be either determin- 
istic or random (white noise) and to obtain, respectively, a Luenberger 
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observer for the augmented system^^^Cd^ Pl“s error system), or a Kalman 
filter in the limiting case of vanishing measurement noise. When the 
model is assumed perfectly accurate » then the standard 

•j 0 j 

Luenberger observer is obtained. In this way the control problem need 

not be considered either as "deterministic'' or "stochastic" until the 

selection of estimator gains is made, and moreover, the choice may be 

made under the consideration of both deterministic and stochastic 

characterizations of the error vectors. 

In section 4.1 we discussed the existence of a linear coordinate 

transformation which would transform the large state vector into 

3 

a vector of variables ^ in which x appeared as a partitioned sub- 

3 

vector of i where x were certain variables whose physical signifi- 
cance was known to us. Here we use the same transformation device. 

We have available to us certain measurements z°(t), which in model 
have been described approximately by z^ = H^x^. 

We wish to obtain an approximation of the vector x^ which has a 
larger dimension than z°. Consider therefore a nonsingular coordinate 
transformation 


x^ = T ^ 

such that if x^ obeys 


x^ = A'^ x^ + u° + W 
z^ = x^ 


yJ = CJ 


n . 

x^ C R ^ 
z^ C R^ 
u°G r"* 


(4.79) 


(4.80) 


then i obeys 
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(4.81) 


i » A^T i + T“^ u° + w 
gj » M^T i 
yj = c^T i 

Furt-henuore , let T be that special transformation which reveals the 
measurement variables, , as explicit state variables (say the first 
V of them) of the transformed system (4.81). Then 

C , 0 < < Z 

^ n.-Z (4.82) 

z*^ C R ^ 



where is an (n . - Z') vector of ''trunsated measurements" in the 
J 

sense that they are not directly measurable from the system (4.80), as 
are z^ . However, since we have available (given T) the differential 
equation (4.81) which 'z^ obeys we suspect that under certain conditions 
we can, by integrating the differential equation in z^, obtain an 
approximation of z*"(t). In fact, since we have the differential equa- 
tion for z^, the only question is whether we can recover the initial 
conditions, z*'(0). Now in the transformed coordinates of (4.81) we 


have 


z^ = dj^j^ z^ + d^2 ^ 

z = z^ + d^2 z^ + u° + 
j j . t 

y = c^ z + C2Z 
= Mj^z^ + M2Z^ 



where 


‘^ll ^12 


*^21 ^^22 


A ^-1 ^ 


(4.83) 
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[C^, C^] = C^T 

[M., M^] = M^T 




Recall that eigenvalues are preserved through a similarity transfer- 
raation. Thus the eigenvalues of T A^T are fixed a priori by the 
eigenvalues of A'^ . Now the fundamental principle upon which reduced 
order observers (state estimators) rely is that even though the 
eigenvalues of T "*■ A^T are fixed a priori, it is possible under certain 
conditions to select T such that the eigenvalues of a block diagonal 
submatrix within T ^ A^T may be arbitrarily specified. The conditions 
under which this is true and the relevance of this fact to the design 
of the state estimators are developed in what follows. To simplify 
our immediate task we will only consider the case jl’ = £. It turns 
out that V = % yields a minimal order state estimator. Circumstances 
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when one might not wish to design the minimal order estimator are 
discussed in the sequel. 

Two requirements on the transformation T are immediate for non 
singularity. From (4.84), 


r 
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(4.85) 
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and from (4.82), 

(4 

.79), 

and 

(4 

A 

.84) 








(4.87) 


For convenience define, T^, T hy 


4 


11 


•21 




12 


‘22 


n.q 


r r 

r = ^21 ’ 22 




Then 


The notation [*]., means that the matrix [•] has dimensions r by k. 

jK 
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Tj T2 

r A^ r A^ 


Also (4.85), (4.86) can be expressed in the form 


T, T- 


In 0 

1 2 


% 

FT, r T- 

= 

0 I 

1 2 


q_ 


T t"^ = T, + T- r = I 
1 L n. 

J 


(4.88) 


(4.89a) 

(4.89b) 


Let us consider (4.79) as the output relationship for the transformed 
system 


i = T"^ A^ T i + T ^ u° 
= T ^ 


(4.90) 


Now if we integrate (4.90) for any given u°(t) we have (t) exactly, 
assuming J (0) = T ^ x^(0) , because (4.90) is just a transformation 
of the defining relationship for x^ ( J ) , namely (4.80). On the 
other hand, suppose we integrate only the z^ equation in (4.90), 
shown more specifically in (4.83). Now, instead of taking 2 ? (which 
is needed to solve for z (t) in (4.83)) as the integral of the first 
equation in (4.83), suppose we take instead the measurements available 
from the physical system, z°(t), and construct (define) the estimate 
as follows 
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(4.91) 


■= ^22 ’*21 *• ®2 
z° + 

In order to see how closely x^(t) approximates x^ (t) , which is defined 
by ( 4 . 80 ) (or equivalently and more conveniently by ( 4 . 90 ) with 
i ( 0 ) = x ^( 0 )), let us rewrite ( 4 . 90 ) relating (that measure- 

ment which is predicted on the basis of the model ^^) to the actual 
measurement z°(t), as follows 



where z* is a measurement residual and is present because of errors in 
the model Then (4.90) becomes 


.0 .ii j / o .wjv , j t , „ o 
z - z"* = ^ ^ u 

x^ = 'z^) + Z^ 


(4.92) 


Defining the estimator error vector 

A z^ - (4.93) 

and the estimation error vector 

x^ ^ x^ - x^ (4.94) 

it is straightforward to show, by differentiating (4.93), using (4.91), 
(4.92) and (4.88), that *z^(t) obeys 

z^ = d^2 ^ 21 " TA^T^z^ - TA^T^'z^ (4.95) 

and likewise for "x*^ 
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(4.96) 


= t 2 r i 

Note also that 

z*" = r 2C^ (4.97) 

The requirements for perfect estimation and asymptotic estimation 
are now before us. If we view (4.97) as the output of the system 
(4.96) we can see that if the pair (T^r , F) is observable, then the 
event 'z*"(t) = 0 guarantees that x^ (t) = 0. Therefore, we conclude that 
the estimate of x^(t) generated by (4.91) can be made to aj^proach x^ (t) 
arbitrarily fast if a transformation matrix T can be found such that 

1. T is nonsingnlar ((4.89) is satisfied) 

2. The eigenvalues of the submatrix d ^2 4 ^ 

arbitrarily specified 

3. The measurements appear as state variables in the transformed 
equations (that is, (4.109) is satisfied) 

and the final requirement is that the model is perfect in the 
sense that z’'^ ^ z° - z^ =0. These conditions are further summarized 
in the 

Definition: The ^)^^ order system of (4.91) is said to be a 

state estimator for the model (4.80) if and only if 

1) 'z^ = 0 (the model is perfect) 

2) r is selected to satisfy (4.89) with the additional require- 
ment that d 22 4 ^ ^^^2 is a stability matrix. 

To show that these requirements for an estimator are the same as those 
of Luenberger, multiply the equation in (4.89b) by FA^ and compare 
with Equation (5.5a) in Reference 55. Note also that the requirement 
4), p. 275 of Kwatny, is equivalent to (4.89b) but the additional 
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requirement, 2), of Kwatny is unnecessary since 2) can be obtained 
by multiplying 4) by F . Now following a suggestion of Reference 
57, p. 606, to make F^^ in (4.87) nonsingular, we shall eliminate some 
of the unknowns in (4.85) to facilitate design of the estimator. 


Suppose the state variables of (4.80) are arranged so that F^^ in 
(4.87) is nonsingular. Then from (4.85) it follows that T^^ and 




(4.100) 


where , 


F 

F 


21 " " ^22 
22 “ "^22 


T F 
21 11 
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After some manipulation it is possible to write the neater form 



(4.101) 


where 




- I 2 (4.103) 

q 

(4.104a) 
(4.104b) 
(4 . 104c) 
(4.104d) 
(4.104e) 


Now, 1^2 serves only as a transformation of estimator coordinates, as 
seen by noting that the eigenvalues of d ^2 the eigenvalues of 
[F„+ T„. G_]. Since the eigenvalues of d _ (and of the estimator; 
see (4.91)) are independent of the required nonsingular matrix "^ 22 * 
we may without loss of generality set, hereafter. 


83 



T = I 
22 


and write 



“12 



M^A^L2 

_*^21 

^22 



V ■^ 21«2 


(4.105) 


By a fundamental theorem of linear system theory, the matrix 

^2 ^ ^21 ^2 arbitrary spectrum if and only if the pair (F^, G^) 

T T 

is observable, or equivalently the pair (F 2 , G^) is controllable (see 
58 

Wonham , p. 660 for a proof). Also, the structure of F 2 and G^ is 
such that the pair (F 2 » G 2 ) is observable only if the pair (A^ , M^) 
is observable. (This can be shown by construction using the definition 
of observability) . 

4.6 Estimator Gain Selection on the Basis of Model 

, One approach for selecting the free parameters, l 2 ^ in the design 
of the estimator, (4.91), is to make the estimator stable with respect 
to model In this event the stability properties of the matrix 

d 22 ill (4.105) are of interest. Another approach is to relate T 2 j^ to 
the properties of the closed loop system using model This latter 

approach will be discussed in section 4.7. Presently, we proceed to 
select with reference to model (that is, select T 2 ^^ in (4.105) 
to make d 22 stable) . 

One approach for selecting the gains of a state estimator proposed 

3 

in the Appendix of Johnson, requires no transformation to canonical 
form as do the algorithms proposed in References 57, 59, and 60. 

The approach also leads to a state estimator which is equivalent to a 
Kalman filter in the limiting case of vanishing measurement noise. 
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In other words, the following completely deterministic design is also 

optimal for the linear stochastic problem driven by white Gaussian 

disturbances for some covariance of the plant. Furthermore, the 

relationships between the covariance of the plant noise and the 

parameters of the deterministic estimator design are given explicitly 

in (21) of Reference 56. One can therefore proceed to design any 

minimal order state estimator on the basis of a given covar— 

.timce matrix of the plant noise or, on the basis of specified stability 

properties of the estimator, whichever information/criterion seems 

more credible in a particular application. It also readily follows 

that any linear state estimator of order n^ is equivalent to a Kalman 

filter in the steady state for some plant white noise covariance 

matrix and some measurement noise covariance matrix (see Reference 61) . 

There are a number of the reduced order state estimator designs for 

62 

stochastic problems by Novak, such as those proposed, which allow 

the order of the estimator to be specified anywhere between n, and 

n.- 

J 


In general, the following conjecture seems appropriate. 

Conjecture: Any stable linear state estimator of order n. n.>n>n.- 5,. 

3 3 

is equivalent to a Kalman estimator for some given covariance matrix 

of the plant disturbance process and some given measurement covariance 

matrix of rank (£ + n - n^). The two limiting cases of this conjecture 

(n = n - Z, and n = n ) have been verified in the literature; see 
3 3 

Reference 56 and 61, respectively. Note that an A x £ measurement 
covariance matrix of rank r < £ means that £ - r measurements (or 
linear combinations thereof) are deterministic (known exactly) . We 
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have now supporting arguments for a previous assertion of section 4.5, 
that a linear control problem need not be viewed as either determinstic 
or stochastic until the selection of the state estimator gains must be 
accomplished . 

A matrix T 22 which will make d 22 (4.105) a stability matrix 

may be obtained as follows. Consider the optimal control problem 

T X 

min / (p p + 03 oj)dt 

03 •'0 

subject to 

• T T 

? = ?2 4 + gJ 03 

P = 

Then if the pairs (F 2 , G 2 ) , ( Q^) are both observable, the 

matrix for the closed loop system 

^ " K " ^2 ^2 (4.106) 

has an arbitrary spectrum, where K is the symmetric positive definite 
solution of 

0 = - K F 2 - F 2 K + K G 2 R“^ G 2 K - Qg Qg (4.107) 

Therefore the choice 

X = - K G^ (4.108) 

21 2 g 

yields a stable estimator (4.91) for any positive definite matrix R^. 
(Compare the matrix in (4.106) with d 22 in (4.105) and recall that a 
matrix and its transpose share the same spectrum). Furthermore, if 

*Recall that observability of the pair (F , G^) is equivalent to 

T T ^ ^ 

controllability of the pair ( F 2 » G 2 ) • 
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the estimator is to have time constants smaller than the "observation 
window," T, specified in section A. 4, this can be accomplished by sub- 
stituting [F^ + l/'T I] for F^ in (4.107). 

By using some results from Kwatny^^ it can be shown that the 
procedure described above (4.107), (4.108) for selecting estimator 
gains yields an estimator which is also optimal in the stochastic 
sense with some restrictions: The (deterministic) estimator designed 

above is also a Kalman filter in the case of vanishing measurement 
noise. Kwatny shows the explicit relationships between the plant 
noise covariance matrix assumed for the stochastic problem and the 
deterministic design parameters such as R^, in (4.107), (4.108). We 
may therefore fix the estimator gains on the basis of either determin- 
istic considerations (stability of estimator) or stochastic considera- 
tions (minimum variance estimator) simply by making an appropriate 
choice for and in (4.107, (4.108). 

To model the behavior of the closed loop system assume that the 
control law is a linear combination of the estimates of the states of 
model 4^2 • 


= G S \t) 


(4.109) 


On the basis of the state estimator designed, at least in structure, 
in section 4.6 we have from (4 . 91) , 


state 

estimator' 


= ^22 -H d2, + B2 G(T^ i T2 z") 

x^ = z° + T 2 2^ 


(4.110) 


Now, therefore, the dynamical system which is composed of the control 
law (4.109) together with the state estimator (4.110) is called the 
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"controller" and is given by. 


controller 


+ [d^^+ 

u®(t) = GT^ z° + GT^ 


Hie controller receives as inputs the physical measurements, z°) (t) , 
and yields as its outputs the controls u°(t) which are to be applied 


to the physical system. 


To evaluate a candidate controller design one could attach this 
controller to the physical system which actually receives u°(t) and 
generates z°(t). Such an experiment with the physical system might be 
economically prohibitive. Suppose for Instance, that the gravitational 
environment on the earth's surface precluded ground test of a fine 
pointing system of a spacecraft; or that a mass <:ransit control policy 
evaluation Imposes unacceptable risk of life and property. In such 
cases evaluations are performed on the basis of mathematical models of 
the physical system rather than the physical system itself u Such a 
prediction of physical system performance on the basis of a mathemati- 
cal model which has be.vm programmed on a (digital or analog) computer 
is called a "simulation." It is really this set of circumstances 
which provides motivation in this research to introduce "multimodel" 


designs. By multimodel designs we mean that selection of design 
parameters and controller structure is not based upon the same 
model. The actual control law design might be (wisely) based upon one 
model, call it whereas the estimator gains might be chosen on the 

basis of still another (better) model, which might be used for 

evaluation of the total controller design. The simpler model 
serves to fix the structure of the controller (i.e., order of 
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6 stimator) whereas the information available in a better model, 4 ^^ 
serves to make the state estimates (of that simpler model) more accurate. 

To see if this is practical let us construct the closed loop 
system with model using the controller (4.111), which was designed 
on the basis of model model 


.1 .1 1 ^ -1 o 1 

x=A x+B u+w 




fc^l 1 

ir 1 ^ 

/ Lm . 


(4.112) 


the controller (4.111), to obtain 
r 




a" 

bVj 

/ l\ 

1 ^ 1 

+ 

B^GT 

z° + 

I 

0 

B^GT^ 

1 ti 
\z / 

d2i«2GTi 


0 

L _ 


w 


0 


(4.113) 


Now recall the definition in (4.29) to write 




M 


Substituting this in to (4.113) yields, using also (4.29), 


✓ 

B^GT^ 

■^21^ V\ 


d22+ B 2 GT 2 


B^GT, 




(4.114) 




w 


0 

0 



(4.115) 
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It proves convenient to transform these coordinates to the coordinates 

( x^j where Is defined by (4.82). Accomplish this by using (4.93), 

f] 


(4.94) and (4.97) to get the simpler form for (4.115) 



(4.116) 


Notice that the system (4.116) is driven by the disturbance w^ 

assumed for model and the measurement residual ^ - z^, 

2 

associated with model and, in addition, the state x (t) and 

2 Q 2 

measurement residual ^ z - z , associated with model 

1 

general, both models are inexact and “z (t) $ 0 and 'z (t) $ 0. The 
^1 ^2 

residuals "z , z , may, in fact, be characterized as random processes 
which might have been Ignored in the control problem but which now 
are to be considered in establishing estimator gains. Other Important 
observations concerning (4.116) are to be made in the special cases 
which follow. 

Suppose in (4.116) it happens that 

Tj^M^X^(t) + r X^(t) = X^(t) (4.117) 

then (4.116) reduces to 
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The nontrivial conditions under tmich (4.117) holds is that the 
("evaluation") model is identical to the ("design") model 
That is (A^,B^,C^,M^) = (A^,B^,C^,M^) , since in that event (4.117) 
becomes 

[T^M^ + T^r - I] x^(t) = 0 

which is guaranteed by the estimator design criteria (4.89b) repeated 
here, (which ensured nonsingularity of the transformation T in (4.79)) 
TiM^ + T^V = 1 

Now in this event where ^1 and identical, (4.118) is 

rewritten 



21 

where % (t) represents those effects not Included (modeled) in the 

model error vector e(t) which became an integral part of the model 
see (4.46b). From the definition (4.26), 

e^^(t) = e(t) + e^(t) 
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Note that (4.119) Is stable if and only if the estimator is stable and 
the closed loop system is stable. 

It is instructive to write the solution for the linear system 
(4.119) , 


+ y^(t) 


y°(t) = [C^ 0] -$(t,t^) ^tj+ J$(t, 


a) 


I 

Loj 


U21. 

e + 


B^GT, 


-d 


21 


^2 

z 


da> 


where for the constant linear system $(t,cf) is defined by 


$(t,a) = exp 


2 2 
+ B G 


-B^GT, 


22 J 


(t-a) 


(4.120) 


(4.121) 


the partitioned parts of which are 

$^^(t,0) = l"^ |si-[a^+ b^g]| 

J 


-1 


$^2(t,0) = L 


I sI-[A^+ B^G]| " [B^GTj[sI- d„]"^ 


$2]^(t,0) = 0 


$22(t,0) = L"^[sI- d22l"^ 

where L ^ denotes inverse Laplace transform and s denotes the complex 
variable. Using these partitioned forms rewrite (4.120) as 

.t 


(t) = $„(t,0) x;; + $^2 




^‘1 \ 

Jq 


C^0^^(t,a)e^^ da 


§^^(t,a)B^GT^ - $^2<t,cj)d2Jz^da + 3r^(t) 


( 4 . 122 ) 


Now we note that the zero state response of the output with respect to 


r«21 

the relative model errors in the plant, e , is independent of 
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estimator dynamics. Thus, these errors are transfered to the output 
via the transfer functions 


y\(s) 

^21 f . 

e ^(s) 


|sl- (A^+ B^G)] 


Ij 


(4.123) 


Likewise it can be shown by taking the Laplace transform of (4.122) 

2 

that the measurement prediction errors, 'z , propagate to the output, 
y°, according to the transfer functions 

(4.124) 
ij 

which transfer function possesses roots associated with the optimal 
control problem (due to the first term in (4.124)) and certain addi- 
tional eigenvalues due to the estimator (but not equal to those of the 

estimator) . Further insight is obtained by substituting the specific 

2 2 2 

choices of the matrices A , B , C , and G defined by (4.46b) and 



(4.57). 

The closed loop system matrix from the control problem is, using 
(4.46b) , 


2 2 
A + B G - 


A^ P 
0 D 


[Gi G^] 


3 3 3 

A + B G^ P + B G 2 


0 


D 


2 3 

then since C = [C ,0], (4.123) becomes 


!>L 

/^21 f s. 

e j (s) 


= {C^[sI-(A^+ 


-1 


-1 


I, + (P + B-"G2)(sI-D) 


(4.125) 


(4 . 126) 
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Now the relative error vector e has the structure 


«w2il 21 

where e ^ and respectively force the plant and the error 
system which is augmented to model (The composite system is 


called model . 


•3 a 3 3 ^ o ^ 31 

X = Ax + B u + e 


But from (4.26) 

31 ^31 ^ ^31 f. 

e - e + e (4 

/v31 

Now the particular choice for e in section 4.2 was from (4.43), 


(4.44) 


31 ’’ 

e^^ = Py 

Y = Dy 

Any errors associated with model tin have the form 



since 


(D 


and (4.128) becomes 


“■f) : :C)*b 

L J \ 


where But from the definitions (4.27) then the error in 

the output (4.43) of the system (4.44) is 
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A gD(t-a) ^ ^1 (4.133) 

^ Jq ^ 

Now from (4.126), (4.131) and (4.133) we have 

y°(s) = C^[sl- (A^+ B^G^)] [i + (P+B^G^)] [sI-D]"^^(s) (4.134) 


The characteristic equation for this transfer function will have the 

roots of the optimally controlled "error free" system and in 

addition, the roots of the error system. The estimator dynamics do 

^21 

not enter into the transfer of relative errors, 62 , of the model 

error system, e = Py ,Y = By, to the output y , at least as far as 
the zero state response is concerned. 

2 2 2 

Now consider the substitution of the A , B , C definitions into 
(4,124). The result is 

y®(s) = C^[sl- (a\b^G^)J B^ Gjr^-T 2 (sI-a 22 )"^a 2 ^]'z^(s) (4.135) 

which relation involves, again, the closed loop dynamics of the 
optimally controlled "error free" system and also the dynamics of 

the estimator (but not precisely the same eigenvalues) . 

21 

Errors in the state equations of the error system, ' 62 '^ , propa- 
gate to the output y° in a manner governed by the eigenvalues of the 
optimal "error free" system and the eigenvalues of the assumed 

modes of the error system, see (4.134). The propagation of errors in 

~2 

measurement prediction, z , to the output is governed by the eigen- 
values of the optimal "error free" system the eigenvalues of 

the estimator, see (^.135). 
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Now the estimator gains generally should not be based exclusively 


upon considerations of either (4.134) or (4.135). The sum of the two, 
for given characterizations of 'z (s) and "e 2 (s) , could be considered. 
However, difficulties arise when the error system model “e = Py , y^Dy 


is unstable, a case that will prove desirable to consider in section 

5.0. In such cases the transfer functions y°(s) (s) are unstable. 

* 

One possible remedy for this situation is to choose the estimator 


gains in the best root mean square sense 


rain 

T 

21 


lira 


U’ 


y°(t)'^ Q y°(t)dt 


1/2 


where y®(t) is given by (4.120). Such a minimum exists if y°(t) is 
bounded. Another possibility is to select the estimator gains to 



y°(t)^ Q y°(t)dt 


For any finite T, much larger than T, the system characteristic time. 
4.7 Selection of Estimator Gains On the Basis of Model 


A more appealing choice for the gains of the estimator than the 
model ^2 referenced decisions of the previous section, is a choice 


After all, it is our own characterization of the error system that is 
unstable and not necessarily the actual "error system." Consider for 
example an actual error vector that is sinusoidal (resulting perhaps 
from a structural bending mode which was truncated from the design 
model) . If the characteristic time of the controlled system is small 
compared to the period of this sinusoid, it might be quite reasonable 
to model such an error vector as a polynomial in time, since the state 
estimator continually updates the initial conditions associated with 
the error system rapidly compared to a period of the sine wave. In 
this way a bounded function is (piecewise) modeled as the output of 
an unstable system. 
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which involves an evaluation model which is better than the 

control design model To appreciate this point consider that it 

is usually the effects not included in the model upon which the con- 
troller design is based, which serve to limit the overall performance 
of the physical system, (which is under the influence of such a 
controller) . The following procedure allows us to consider plant and 
measurement errors which were neglected in the previous modeling prob- 
lems (and therefore in the estimator design) . The flexibility offered 
by this "second chance" is as follows. To include all such (possibly 
correlated) model errors in the original statement of the problem 
causes the order of the estimator (and therefore of the controller) to 
be large. To neglect such disturbances all together limits the per- 
formance of the overall physical system plus controli, r, perhaps 
unacceptably. 

To illustrate this point consider a completely deterministic 

design. That is, the model errors defined for the model are completely 

deterministic, as is suggested in the previous sections of this 

chapter and in the "external disturbance accommodation" work of 
3 4 

Johnson. * But on the basis of the deterministic design there is no 
natural "bandwidth-limiting" mechanism. A requirement of asymptotic 
stability of the estimator design moves the eigenvalues "to the left 
of the imaginary axis" of the complex plane. The requirement that 
the estimator be faster than the system in the sense of placing eigen- 
values "to the left of a vertical line s = - 1/t," where T > 0 is a 
specified number (such as the characteristic time of the system, see 
definition (4.66)) are used by some (see section 4.0 of this paper 
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3 

and Johnson, page 20.) There are not, however, any known procedures 

in the deterministic theory which naturally force the eigenvalues to 

"lie to the right" of some line, thereby limiting the spectral radius 

(r[Al A X [A] of the closed loop system matrix A, and limiting the 
== max 

amplification of system modeling errors.) In an engineering sense it 
is precisely the feature of the stochastic optimization theory which 
limits the spectral radius of the estimator (actually, by specifying 
the poles uniquely, in the case of the Kalman filter) , that is so 
helpful. It is not the lack of uniqueness in the gains of the 
Luengerger observer that invites poor pole placement, but the lack of 
upper and lower bounds. The reason of course, that they are lacking 
is that there is nothing in the model itself to warn of the limits of 
its fidelity. There is associated with each model, , a finite 
region in the complex plane, , whose shape (and even connectedness) 
is unknown, but which represents a region of "model credibility" (or 
"spectrum of control authority") in the sense that closed loop system 
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poles placed outside this region cause the closed loop system 
performance to "disagree" with that performance predicted on the basis 
of the model . Such a set, is a "fuzzy" set in the sense that the 
boundaries cannot be precisely defined. Moreover the information to 
establish the boundaries of is not continued within the model itself. 
These various reasons prompt us to investigate the possibility of 
designing controllers on the basis of more than one model. These 
"multi-model controller" designs have the advantage of the lower order 
that the minimal order Luenberger observes affords, together with the 
band limiting qualities afforded by the consideration of a second and 
better model. 

Returning to the task of estimator gain evaluation on the basis 
of model the equation of interest is (4.115), rewritten here in 

the compact form, 

X = Ax + z^ + G^w^ (4.136) 

o .1 

y = cx + y 

111 

where we may take X , *z , w , y to be either random or deterministic 

o 

in character, and where 



C A [C , 0] 



99 



We wish now to solve the problem 


^^21^ 




(t) Q y°(t)dt 


(4.137) 


for finite T (and also consider the limit as T ^o) . Now if we 

consider to be the deterministic term forcing the linear plant 

and G w the zero mean white noise randor:. process whose covariance is 
r r 

known, and y is a zero mean white noise random process whose covar- 
iance is known, then we can write (4.136) as 


x = Ax + G, Wj + GW 
d d r r 

O r> A. 

y = Cx + y 


(4.138) 


where the specific definition of and G^W^ will depend upon the 

choice of *z^ and w^ characterizations in (4.136). From Appendix B, 


Equation (B.IO) we can immediately write (4.137) as 


J*(T„.) = min tr C^^QC f ^ x^\dt (4.139) 

J0\ / 

where obeys 

E = + Ea’^ + E 

(4.140) 


and X is defined as 


.141) 

where $ is the state transition matrix for A, and satisfies 

$(t,a) = A $(t,a) ^>(o,a) = I 

Thus, it is not necessary to restrict the original model error vector 
21 

e (t) to a deterministic description. It is however, convenient to 


x(t) A 


/ 

Jn 


$(t,a)G^WjdcJ 
a a 


(4 
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do so, and no generality is lost when one considers that the estimator 

can be designed with full dimension (of order n^) or with reduced 

"l 

dimension (of order ^ ^ ~ that in the full dimensional 

case the gains may be selected for "optimality" in the stochastic 
sense, or for "stability" in the deterministic view. In the reduced 
order estimator the gains may still be selected for stochastic opti- 
mality in a limiting sense, corresponding to a Kalman filter with 

56 

vanishing measurement noise (see Kwatny ). Thus, it is convenient 
and not restrictive (given any constraints on the order of the esti- 
mator) to consider the deterministic design. In this view of the 
model error description, the results of this research can be useful 
to obtain a model, and a design, for systems subjected to correlated 
disturbances. The modeling of correlated processes is an underrated 
task. 

It is well appreciated in stochastic problems that the condi- 
tional expectation and more specifically the Kalman filter, is a 
"biased" estimator if there are correlated disturbances (or "model 
error vector") which have been modeled as white noisv?. It is common 
to provide the simplest Gauss Markov model for such a disturbance, a 
"bias" model (an unknown constant plus white noise, y - 0 + w(t)). 

* 

Assuming Gauss Markov models of correlated disturbances are appro- 
priately given by the deterministic structure given in (4. A3), 

(4.44) with white noise added to (4.44). 

** 

If only the first and second moments (the means and the covariance 
kernels) of the random processes, are known, then the Kalman filter 
is the best linear estimate for a large class of optimization cri- 
teria (including minimum variance) , and this is true regardless of 
the statistical properties of the random variables (see Theorem 5.3, 
p. 166 of Reference 38). 
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Recall from section 4.2.2 that the modes present in the actual model 
error vector e (t) which are important to model in the model error 
vector approximation, e(t), are those which "appear to be present in 
an interval x" (x is called in section 4 the characteristic time of 
the system). Therefore, the condition under which bias models are 
always sufficient as a characterization of a correlated disturbance is 
that the system characteristic time approach zero (corresponding to a 
controlled system with an infinite "bandwidth", to further abuse a 
common word in control engineering) . It is perhaps unfortunate that 
in the vocabulary of the stochastic control literature the words 
"biased" and ’'unbiased" are used to describe estimators, when in fact 
the difference between the mean of the state and the mean of the 
estimate of the state is not at all constant. Nor should one be led 
to believe that the "biased" estimator can be made "unbiased" by the 
incorporation of "bias" disturbance models in the estimator (filter) 
design problem. The next simplest class of models for error vectors 
is the polynomial model (of which class the bias model is a member) , 
and section 5.0 will treat this class of error vectors. The signifi- 
cance of that procedure in section 5.0 to stochastic problems is that 
the order of model for the correlated disturbance (error vector) is 
related to the characteristic time of the system (which time might 
have also been labeled the "correlation time") . This makes sense 
because white noise models are appropriate only for those processes 
whose correlation times are small compared to the system characteristic 
time. In this sense, section 5.0 can be viewed as a procedure to 
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determine that model of the system Including disturbance and 
measurement processes which can be "appropriately" driven by white 
noise . 

To summarize section 4.0 it is helpful to consider Figure 4-2 
where the controller design events are summarized. The hierarchy of 
models indicated in Figure 4-2 is utilized in the following way. The 
physical system is committed to a mathematical description, 4^^, 
which represents the physical system sufficiently closely to serve as 
a final evaluation model (i.e., via computer simulation) to select 
between candidate controller designs, (in the event the physical 
system, is not used for that purpose) . The model is a 
linearized model whose order exceeds that acceptable for on-line state 
estimation. 

Model ^2 ^ truncation of (whether actually constructed in 

this way or not) and some errors between and are noted (i.e. 

31 

the specification of e by "assumed modes" (4.43), (4.44)). A con- 
troller design on the basis of is likely to be sensitive to the 
modeling errors. Model is therefore introduced, and is composed 
of the model error system (4.43), (4.44) augmented to the truncated 
model The order of acceptable (actually by definition of 

^3’ ^2^ controller design, and the structure of the controller 
is established on the basis of the model Now, the gains of the 

estimator, T 21 * selected on the basis of model or, on the 

basis of the better model This latter "tri-model" design pro- 

cedure (refer to the "design triangle" in Figure 4.2) is more difficult 
computationally than conventional Kalman filter or Luenberger observer 
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Figure 4-2. A Tri-Modet Design Procedure. 




lOA 



designs (which are based upon a single system model which is assumed 
perfectly accurate) but has the following advantages. The reduced 
order Luenberger estimator has no natural "band limiting" features 
unless white noise is considered present in the plant, in which case 
the estimator design becomes a Kalman filter in the limiting case of 
vanishing measurement noise. By considering the tri-model design 
procedure the estimator gains can be influenced by the presence of 

both (white) noisy characterizations of relative model error vectors 

21 1 
S (which play the role of plant noise) and measurement residuals 'i 

(measurement noise). The following section 5.0 helps to further sys- 
tematize the choice of the model error system characterization (that 
is, the selection of P(T,d) and D(x,d)). 


* 

The eigenvalues can fall arbitrarily far to the left in the complex 
plane. 
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5.0 Orthogonal Filters for Model Error Estimation (OFFMEE) 

In section 4.2.2 It has been suggested that the approximation of 
the model error vector e(t) by e(t) be done with closed loop operation 
In mind so that state estimation can be applied to the differential 
equations the error Is assumed to obey. Thus, e was chosen as a least 
squares approximation to e(t) yielding the matrix of coefficients P 
g’vven by (4.41) and the matrix D as determined to satisfy (4.17), 
where the f^(t) are a selected set of approximating functions . In 
this section the f^(t) are selected to be orthogonal functions with 
respect to the weight g(t) on the interval [0,x], see (4.41). Toward 
this end we introduce differential equations (for the error system) 
which orthogonal functions are known to obey. We will restrict our 
attention to the class of Jacobi polynomials which include via change 
of variable, Fourier, Legendre, Chebyshev and other polynomials. 

Before specializing to these cases, it is helpful to first develop a 
model error system of the form 

e = Py 


(5.1) 


Y « DY 

which is capable of generating the more general Jacobi polynomials. 

It will be convenient to normalize time to vary over the interval 
[-1,1] instead of the interval (0 ,t]. Such a change of variable is 
accomplished by the definition, 

a A 2 - 1 (5.2) 

Jacobi polynomials J^'‘ *^“^(a) are determined by the property of being 
orthogonal on the interval -1 £ CT £ 1, with respect to the weight 


page blank nut 




cc 6 "* 

g(a) “ (1-a) (1 + a)^ and may be expressed in the form Reference 1, 


page 24, 


j (a,3)(^) „ (1 _ iL (5.3) 

^ dr 


Theorem 5.1: 

The model error system which yields the least squares approxima- 

C fV ft 

tion of e(a) to e(a) using Jacobi polynomials, f^(o) “ * '(a), 

can be written, in terms of the normalized time, a, 

e(a) = P y(c 7) 

= D(a) yia) 


(5.4) 


where 


YC R' 


zd 


g C R 


m 


■ ■ /.: 


g(a)e(a)f‘‘(a)da A' 


P = [P',0] 


D = 


0 


- diag. 


0 

2+a+6 


?. 


1- a' 


2(34a+g) 


1-0 


\ 


\ 


(d-1) (d+g+6) 
1-0 2 


-0+a+(g+3+2)o 

^d 


1-0 


2dx2d 
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where diag ( • ) means a diagonal matrix with the ordered elements of 
(•) on diagonal. This result, (5.4), is the first order form of the 
Sturm-Lionville, equations. Reference 62, page 60, and the eigen- 
functions of (5.4) are Jacobi polynomials, y^(^) “ f^(<^) = J^^^’^^(a). 
Now, depending upon the choice for the parameters (a, 3) the system 
(5.4) may generate Legendre polynomials (a = 3 = 0) , Chebyshev poly- 
nomials (a = 3 = - 1/2) and other polynomials of interest. It is also 
possible to select time invariant representations of (5.4) for the 
cases of interest which follow. 


5.1 Chebyshev Error Systems 

The special case of Theorem 5.1 when a = 3 yields ultraspherical 
polynomials of which class the Chebyshev polynomials are a member 
(specifically a = 3 = - 1/2) . The Chebyshev polynomials are of 
primary interest in the remainder of our study because they include 
(by change of variable) the Fourier series. They also provide a 
simple and systematic extension to the "bias'* models often used to 
model correlated disturbances in stochastic problems. Because of the 


good curve fitting properties of Chebyshev polynomials we introduce 
Theorem 5.2; 


Suppose the model error system is modeled by 
e(a) = P y(a) 

Y’(<y) = D y(a) 


(5.5) 
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P is the set of Chebyshev-Fourier coefficients. 


P “ 






g(a) e(a) f^(a)da 


7r 


1/2 

0 




d-i 


(5.6a) 


0 

1 

0 

3 


0 

0 

4 

0 


0 

0 

0 

6 


0 

0 

0 

0 


dxd 


(5.6b) 






(5.6c) 

(- 1/2 - 1 / 2 ) 

where * (a) A T^(a) are Chebyshev polynomials of the first 

kind of degree 1, 

—1 2 

T . (a) A cos(l cos a) , a A — t - 1 

1 — = T 

Then, any solution of (5.5) satisfies 

Ho) = P(Yq) T(a) (5.6d) 

where 

I 

P(y ) A P 
' 'o = 

(note: y(c = - 1) = y(t = 0)) 


T 

''o 

T 


■^0 Qi 


Yr 


Yo ^d 


(5,6e) 
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where 



The proof follows from theorems A6 and A7 in Appendix A. 

We wish to now interpret the performance of the state estimator 

X 

for Y(t), operating in the real time, t = (o + 1) • 

Corollary: 

Given the model 


A 


, .3 .3 3 ^ ^3 o 

|x = A X + B u 


3) 3 .3 3 

z = M X 


and the model A„ 


.3 .3 3 . ^3 o . 

X =Ax +Bu + e(t) 

2 3 3 

z = M X 


\ 


where 

e = P Y 
Y = D T 


(5.7a) 


(5.7b) 
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Proof: The proof follows immediately by construction of the solution 

of (5.7a), (5.7b), and the definitions in (4.29), and Theorem 5.2. 

On the basis of Theorem 5.2 and its corollary, it is possible to 

interpret the performance of the state estimator (whose construction 

is based on model ^he following way. Assume that e"^ (t) is a 

31 

very good approximation to e (t) so that, in terms of the definitions 
in (4.26), e^^ = 0. Then = 0 (by definition of e^^ in section 4.1), 
and (5.7d) Implies that the difference between the physical measure- 
ments z°(t) and those predicted by model is equal to a linear 
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combination of Chebyshev polynomials, and furthermore the coefficients 

of these polynomials are a function of the initial conditions, 

Recall from a discussion in section 4.5 that the presence of a nonzero 

0 3 

measurement residual, z - z , causes the estimator to change its esti- 
mate of the state vector from that predicted solely on the basis of 
the model. Therefore observe from these comments and (5.7d) that if 
the estimator is stable, then, in response to real measurements z°(t) 
the state estimator automatically selects those coefficients of 

Chebyshev polynomials which cause improvement in the estimate of the 

3 31 

state vector x (t) and the model error vector e (t) , such that those 

3 

measurements predicted by z (t) are closer to those from the physical 
system, z°(t). This is not "adaptive" curve fitting of the measure- 
ment residual in the sense of changing parameters within a fixed model 
structure, but an "adaptive" function has been served by increasing 

the order of the model with the error system. The Chebyshev polynomials 

31 

used to approximate the model error, e (t) , which are automatically 

selected by the state estimator for y(t) , yield a least squares approx- 

31 

iraation to the model error vector e (t) . Such an approximation of 
31 

e at any time, t, is based upon the present model error and the 
prediction of model error t units of time into the future. To see this 
prediction feature, note the estimator design of section 4.5 and note 
that the matrices P and D are functions of the prediction interval, T. 
This makes the inherent prediction feature of the estimator accurate 
only over the selected Interval T. Selection of x was related in 
section 4.4 to a property of the closed loop system called the charac- 
teristic time, see (4.66). 
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The estimator associated with the variables y(t) (noting that 

2 

X A (^ )) can be thought of as an "orthogonal filter" to estimate 
model error, since the model upon which that portion of the state 
estimator associated with y is based generates orthogonal functions. 

The particular error system given by (5.5) is called a Chebyshev 
error system. The design of "orthogonal filters" for model error esti- 
mation (OFFMEE) is summarized in the specific steps of the next section. 

Some further properties of the Chebyshev error system (5.7) need 
to be mentioned. The eigenvalues of D are 0 repeated d times, and 
this means that the error system model is unstable (see Appendix A) . 

The resulting closed loop system of a uni-model design is unstable. 

That is, the analysis or "evaluation" of the controller with respect 
to the same model, upon which the controller design is also based, 

indicates instability. To see this, refer to (4.11g) (further, 
assume the model is perfect (3?^, e^^, 'y^ all zero)). Then, using 
(4.125), conclude that a subset of the eigenvalues of the closed loop 
system (4.119) are those of D. Thus, on the basis of this "evaluation" 

of the controller, some components of the state trajectory 
2 3 

X (t) = (x (t)) approach infinity as t “ from some nonzero initial 
Y (t) 

conditions. Two observations are pertinent to this situation. The 
first has to do with the interpretations of this instability conclusion, 
and the second has to do with the alternate suggestion for design of 
the controller on the basis of more than one model of the system. 

The concept of stability imposes limiting time, t ■+• «>, arguments. 
That is, from a given set of initial conditions, the Initial conditions 
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must not change again V tClt^, «•) . However, in the construction of 
a model, we carefully noted that the model was capable of accurately 
portraying the output of the physical systems only for a relatively 
short time, and the particular time over which the model was argued to 
be credible was the characteristic time, t, defined in section 4.4. 

It was specifically argued in the model error system construction of 
section 4.0 that only modes of truncated states which appear to be 
present over an interval T need be modeled in the error system. For 
example, suppose an uncontrollable mode of were truncated from 
Let the mode have the effect of a damped sinusoidal forcing function 
on the state equations. Let the effect of this truncated mode be 
modeled (even exactly) over an interval x as a second order polyno- 
mial in time. Such modeling relies upon the state estimator to update 
the initial conditions associated with the error system, (for accurate 
modeling of the errors) but arguments of stability remove the (prac- 
tical) circumstance of initial condition changes (Vt^[t^,“)) asso- 
ciated with the assumed modes of the error system. Therefore, since 
the model of the dynamical system is credible only over a relatively 
small time, T, (from a fixed initial condition) designing for stabillty 
has less conceptual appeal than designing for performance of specific 
variables as discussed next. 

Suppose now, that all parameters are fixed by the controller 
design except the estimator gains, (see (4.110) and (4.105)). 
Section 4.6 shows how the estimator may be stabilized. However, the 
closed loop system, (4.115), (with respect to model may not be 

stable, from above arguments. If, instead of using in (4.115), 
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we "evaluate" the controller with respect to a different model, 

the system (4.115) might indicate stability, (i.e., any 
model for which GT^M^ is a stability matrix). Instead 

of using stability arguments to fix the estimator gains, one might 
choose to minimize the functional 


J(T 




y^ Qy dt 


(5.8) 


(y is the output of (4.136) with = jd ^ for finite T. Now again, 
this functional is considered for a set of fixed initial conditions, 
and in the limit as T ->■ «> we are faced with the same conclusions that 
troubled us with the stability arguments: For fixed initial conditions 

3(1^^^) might not provide a faithful prediction of the physical system 


performance for large T. Making T equal to a fairly small multiple 
of the control system's characteristic time, T, might provide an answer 
in closer agreement with the physical system, (i.e., T = 20t or 
T = IOt). It was mentioned early in this report that the performance 
of controlled physical systems is usually limited by those effects not 
modeled in the design of the control law. We may wish, therefore, to 
choose on the basis of (5.8) where y is the output of (4.136) (as 
written, in terms of model parameters) . 

The Case of Infinite Terminal Time and Chebyshev Error Systems . 

The defining equation for L, Equation (4.53), can be used (in 
the case of T = “) to produce a recursive solution for the elements 
of L as follows. Since for the Chebyshev error system the matrix D 
has a lower triangular form, the expression (from (4.53) with LEO 
corresponding to T = “ in (4.46a)), 
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T 

„3„3 -13 

r 

3 1 

KB R B - A 

J L 

L D + K Pj 


can be written 


133 - 13 ^ 3 ^ 

= Ik B-’z B - A-^ 


] Ilk f-kl 


where il > j and the range of j is j“l,2,.,.d. Thus, beginning with 
the d*"^ column of the matrix L, denoted by L*^, where 

L » [L^, L*^] , P = [P^,P^,...,P‘^] 

we have 

= [k^S^ “ 

due to the fact that the last column of D is zero, see (5.7c). Pro- 
ceeding to write the columns of L proceeding right to left among the 
columns of L, 


= [k\ 3 . /] ^^3j,d-2^ J_d 


- [k^S^ - A^’'] 

c 




d-l,d-2 


+ L 


^d,d-2) 


- A^‘J (kV-^ + L^-2 l"- Vl,a-3^ 


rd-A L 3_3 .3"! L3„d-4^ d-3^ ^ T‘i“2^ i xd'l 

L - LK S - A J |K P + L ^(j_3 d-4^ ^ Dj o A /+ L D 


T(. 

T‘(. 


,d-3) 


d-2,d-4 “ ‘'d-l,d-4 

°d,d-4) 


+ L 


A^T (k^p"-^+ g ""'S-k,d-3) 


, 3=0,1,. . . ,d-l 
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Thus, the above equations represent a solution for the columns of L 
beginning with as given, requiring only that the matrix D have a 
lower triangular form (with zero diagonal elements) . In the case of 
the Chebyshev error system the matrix D has certain other zero entries 
which may be omitted. 




^d-j+k,d-j^ 


d-j+k 


) 


j— 0,1,2, • . . ,d~l 


Lemma: 

If the square dxd matrix D has a lower triangular form, and the 
nxn matrix S is nonsingular then the equation 

• 0 



has the recursive solution which generates the d columns of L, 

d-j+k1 


= S" 




j— 0,1, • .d— 1 


beginning with the initial condition 

d -Id 
L = S F 


5.2 Summary of a Tri-Model Design 
Procedure using OFFMEE 

In section 4.0 it was suggested that the controller design 
involve more than one model. In section 5.0 it was suggested that the 
model upon which the controller structure is based include an error 
system such that the model errors are approximated with orthogonal 
functions. The design of the controller based upon this augmented 
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system involved the "orthogonal filter" which estimates (and therefore 
reduces the impact of) the effects of the model errors which include 
external disturbances, truncated modes, and parameter errors. In this 
section we outline the step by step procedure to accomplish these 
design tasks. 

Before outlining the steps it is helpful to review the total list 
of parameters which are to be determined, in order of their determin- 
ation. 



Parameters of the model jd. 


.3 ,3 3 ^3 o 

X = A X + B u 


(S)- 




M' 


weighting matrices for the optimal control problem 



3 3 3 

K(A ,B ,C ,Q,R) Riccati matrix for 4 ^^ 

3 3 

t(A ,B ,Q,R,K) Characteristic time of 

31 

e (t) assumed "mode shape" of model error vector 

D(x,d) Matrix for synthetic modes of error system 

P(T,d) Chebyshev-Fourier coefficients of error system 


L(P,D,K,A^,B^,R) =» G(R,B^,K,L) . 


Feedback gains for model error 
estimates 
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z measurement errors in model which are to be considerea in 

estimator gain determination 
^21 / 

e model errors in which are to be considered in estimator gain 
determination 

^21 ,M^ all of above) Estimator gains, selected on the basis 

of either 1) model 
2) model 

d(J(^^)) Order of the error system 

The procedure is described for the case of infinite time solutions 
of the optimal control problem. There are special considerations to 
be made, such as imposing an artificial (large) finite value for 
terminal time, when existence of an infinite integral is questioned, 
and these are cases appropriately noted, 

ASSUMPTIONS : 

1. controllability of (A^, B^) 

2. observability of (D,P), (A^, C^) , (A^, M^) 

3. Q, R both positive definite 

STEP I; Determine (Reduce from or write directly) 



T 

- r"V K (5.9) 
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STEP III; Solve for T, the characteristic time of 


STEP V: 


STEP VI I 


determine t 


from 


T “ X 


max 


r 3-13^ 3^ 3 I 

[KB-^R K + C QC J K 


(5.10) 


31. 


STEP IV; Assume a characteristic shape for e (t) 


31 

determine e (t) 


from 


31 3 1 

a) compute precisely that e (t,x^,x^) corresponding 


to 


or 


31. 


b) assume some shapes expected in e (t) (damped sine- 
waves, polynomials, etc.) 

Select orthogonal approximating functions f^(t) (i.e., 
Chebyshev, Legendre, Fourier, etc.) 


Specify D 


from 


f = D f 

For Chebyshev error systems 


D(T,d) A ^ 


0 0 0 0 
10 0 0 
0 4 0 0 

3 0 6 0 


(5.11) 


*_J dxd 

Solve for the Chebyshev-Fourier coefficients, P. 


determine F(T,d) 


from 
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■/.: 


g(cr) e^^(a) f^(a)da 


(5.12) 


where 


A-" = 


T ' 

-1 

9 

"l/2 o' 

1 g(a)f(a)f (a)da 

7T 


*'-1 

° ^d-1 


a A 2 Y 


STEP VII: 


- 1 


f.(a) = T.(a) = {a) = cos(± cos ^a) 

X X X 

Solve for error system feedback gains 


-1 3 

Determine = - R B L 


from 


[kb\“^B^ - ] L - LD = KP (5.13) 

Selection of estimator gains may be based upon either of two options 
(Villa, IXa) or (Vlllb) 

(STEP Villa) ; Specify measurement errors, and relative model 

21 

errors, e , (which are to be neglected in designing 
the structure of the estimator but which are to be 
considered in the selection of estimator gains.) 


Specify z^(t) , *6^^ (t) 


from either 


a) deterministic: z^(t) , 'S^^(t) 


-1 T 


b) random: E[z^] , E(2^- “z^) (z^- 'z^) 


E[e^^] , E(e^^- ^^) (?^^- e^^)^ 


In the infinite time problem the optimality of G has not been 

3 3 3 

guaranteed but the existence of L has, provided (A ,B ,C ) is a 
controllable, observable model. 
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(STEP IXa) ; Solve for estimator gains 


determine T 


21 


from 


min 
T 


1 T 

i tr C^QC 


Jo (2+='-")^'= 


21 

where obeys 

and X is defined by 

x(t) = r $(t,a)G^W^da 

•^0 


. E(0) = e[x^x^]= 


(5.15) 


where $(t,a) satisfies 
6(t,a) - A 'I>(t,a) 
and 


$(a,a) = I 


A^ + B^GT^ 

B^GT2 

[a2i+ B^GT^lM^ 

^22 ®2^"^2 


A = 


where and are defined by (4.102) and (4.103). 


Ti = 


T = 
2 


^ ~ ^12 -21] 


r”^ I I - r 

ill IX 


21 


_ r r T 
11 12 22 


'22 


(5.16) 


(TAKE ^ 22 = I without loss of 
generality) 


[r^i. A 
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STEP Vlllb: 


Solve for estimator gains 


determine T 


21 


from 




where 




^2 ^ [“q^ ■ \] 


W p .1 



nj njd 



0 , D , 


I 

[dns d J 


q 


A - 

^3 

p . 

njd 


_r“^ r 
11 12 

^ Zq 


L ^^3 

“d 


1 

q 



r,„ 1 A 


^n_ A n_ + d\ 



2=3 \ 

yq 4 "2 ■ y 


any positive definite matrices. 

STEP IX: 

Specify d 


1 

from 


V .j. 


I Q dt < v° 


(5.17) 


(5.18) 


(5.19) 


(5.20) 


(Iterate for d=0,l,2,... ) 


Several facts are summarized here concerning the options for 
estimator gain selection, T 21 * further discussion In sections 

4,5, 4.6). 


The subscripts on the matrices denote dimensions a x b, unless 

a = b. In which case only one subscript Is used. 
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Consider the design of the estimator for model ^2 (the order, n, 
of the estimator is then n 2 ~ ^ £ n£ n 2 ) . It is helpful to keep in 
mind that the Luenberger observer has no natural "band limiting" 
feature which is Imposed upon gain selection, while the Kalman filter 
has no natui^^I lower limit on the gains. 

1. If the order of the estimator is n 2 , then it is possible to 
select the gains so that the estimator becomes the steady state Kalman 

«w21 

filter. (Implying that the model error "e in (4.132) and the measure- 
ment residual error 'z (see (4.29)) may both be considered to be white 
noise even though they are neglected altogether in the deterministic 
point of view which leads to the same estimator) . 

2. If the order of the estimator is n 2 - as outlined in section 
4.5, then the gains may be selected in STEP Vlllb so that the esti- 
mator still becomes a limiting form of the Kalman filter corresponding 

/ 

to the case of vanishing measurement noise. (In this way the selection 

, 

of T_^ is influenced by white noise errors e but cannot be influenced 
21 ^ 

by the measurement residual errors , 'z ) . 

3. If the order of the estimator is n 2 ~ as in section 4.5, 

and the estimator gains are selected as in STEPS Villa, IXa, then the 

^21 

presence of random errors in "e and the presence of random errors in 
z both influence the selection of gains, in addition to any determin- 
istic effects presumed present . 

The following section illustrates these steps of the design pro- 
cedure for a practical problem. 
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Figure 6-1. Large Space Telescupe. 




exchange controllers called reaction wheels . Modal analyses of the 
flexible LST structure have identified structural modes from .1 hz 
upward. There are a number of problems in strategy that immediately 
beset the control designer: He must develop a model of the physical 

system including "significant” dynamical effects. He must design the 
controller to actively control "significant" modes of the total 
dynamical system (including internal and external disturbing 
effects) . The interdependence of the modeling problem and control 
problem is evident since one cannot usually know what dynamical modes 
of the system must be actively controlled (are "significant") prior to 
establishing a model. The model which is appropriate depends, in 
turn, on what disturbing effects must be controlled. In practice 
several iterations of models and control laws are usually tried. One 
may avoid such iteration by working with the most general, or "con- 
servative," model first (which is credible at "all" frequencies) . In 
practice, however, such a model might be extremely costly (in time and 
resources), if not impossible, to construct. Moreover, if it were 
used for control design, a complex controller may result which also 
feeds back insignificant variables. Thus, the economic motivation of 
the Minimal Controller Problem of section 2.0 comes to the fore: 

Design the least complex Controller that will satisfy the performance 
requirements. From this motivation we reduce our large model of 
LST, derived in Appendix C, to a very simple model^^. We augment to 
an "error system" to partially compensate for the modeling errors 
associated with this drastically reduced model. 
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The "line of sight," or the "optical axis" of the telescope can 
be displaced due to the effects of theirmally induced displacements 
of the solar panels. This "disturbance" turns out to be significant 
and the effects are approximated in the derivation of in Appendix 
C. The approximations of gravity gradient, magnetic, and aerodynamic 
disturbances are based upon a rigid spacecraft and are also summarized 
in Appendix C. 

6.1 Discussion of The Model 

In Appendix C a linearized model for LST is developed considering 
the effects of spacecraft flexibility, thermally induced appendage 
motions, gravity gradient, and aerodynamics. The model given by (C-77) 




( 6 . 1 ) 
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where empty partitions in matrices imply ^ero entries. See Appendix C 
for parameter definitions. Here, x^ is a (24 + 2v) -vector where V is 
the number of flexible appendage modes retained in the model 
(\)= dim n) . Of the 24 other variables, 4 are associated with rate 
gyros j 2 with position sensors j 2 with rate signal filtering, 2 with 
position signal filtering, 8 with reaction wheel control and 6 with 
the rigid body dynamics. Figure C-3 illustrates in block diagram 
form the general model* Now according to the STEPS outlined in 
section 5.2 we begin our OFFMEE design tasks. 

6.2 Development of the model (STEP I) 

The pursuit of formal procedures for reducing the model from 
to shown as Step I in Figure 4-2 is reserved for further research. 

One attractive method for accomplishing such a reduction, given the 
desired order of presented in Reference 11 but the method 

requires manipulations of which are costly owing to its large 
dimension. We will proceed to select a simple model for dii^ectly 
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which is based upon ad hoc procedures in order to illustrate the 
procedures of sections 4.0 and 5.0. 

— ^ * 

It was argued in Appendix C that the cross product terms, (S h) 6, 
between controller momentum and vehicle rate were small, as were the 
non-secular terms from external disturbances, G(t)0. Equation (C-74) 
then becomes 




( 6 . 2 ) 


where the additional definition has been substituted, SK i A T , 

m = c 

where T is taken. as an approximation of the control torque. Noting 
c 

/N— 2. 

the definition of J in (C-75) this model becomes 




_i * X T 

where F ^ [J - F ] . (The F of this section should not be 

confused with F of earlier sections. See Appendix C for definitions). 
* 

Now if we choose to truncate all appendage modes from the model, we 
must write, in the spirit of section 4.0, 


Generally we might choose to keep some appendage modes within the 
design model and truncate others, depending among other things upon 
the uncertainty in the modal data. 
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(6.4) 


e = T + e(t) 
c 

31 

where the model error vector e evolves from the error system 
(regerding (6.2) as model for our present purposes), 

e = f“^ r$(25a f) + n) + ^"^(^0“ Im'L) 

^ [I-f-$VF"^r$][2Ca fi + a\] - 

- $'^r^F"^ T + [I+$VF"^r$]$^ (6.5) 

o 

If we further simplify the model to a single axis problem (principal 
coordinates assumed) about axis then 

9„ = {f“^ T } + e* (6.6) 

2 c 2 2 

error e„ = {F ^r4>(2?a li + il)}o+ (f ^(T - ^ A)}_ 

z 2 o ^ 

system fi = - $^r’^F~^(T + T ) + [I + $^r’^F"^r$l$'^ ^A 

o c 

An alternate approach of approximating the truncated effects is 
to identify a small parameter which multiplies the coupling terms on 
the left hand side of Equation (C-73) and apply singular perturbation 

g 

theory in the solution of the control problem. The methods of may 
be applied in this event as discussed in section 3.3. Such an approach, 
however, seeks to implement a control law which feeds back all the 
states of the higher dimension of (C-73) and is an approximate solution 
to a higher order problem. We wish instead to formulate a version of 
the problem that is less exact (than (C-73) represents) and then solve 
this approximate problem exactly. Toward this end we identify some 

For convenience j we drop the 31 superscript on the error vector. 
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properties of truncated effects and implement an approximation which 
evolves from an "error system" which is of smaller order than the 
truncated modes, n in (C-73) . 

Returning to our immediate task of constructing a simplified 
model, noted in the discussion of section 4.0 that the 

decision of how many modes to assign to the physical variables in 
and how many modes to assume in the error system was not a straight- 
forward decision to make even for fixed dimension of the (n^+d) -vector . 
(n^ is the order of order of the error system) . 

There is, for instance, the decision of whether to incorporate a given 

3 

appendage mode into the model and hence into the vector x , or 

3 

whether to delete (truncate) the mode from (x ) and incorporate 

its effect in the "synthetic modes" associated with y. The first 
difference to appreciate is that certain parameter uncertainties may 
be "forgiven" if the mode is accommodated within the synthetic modes 
of the error system as discussed in section 4.0. If all the param- 
eters associated with the mode in question are quite well known and 
if the mode is known to have a significant effect upon the output 
then it may be desirable to leave the mode in Usually in space- 

craft structures the modal data is somewhat in question. The fre- 
quencies of the assumed modes are often uncertain because of the 
approximations made in characterizing the model. For example, the 
frequencies of the forced appendage are not the same as those obtained 
by solving the eigenvalue problem for the homogeneous appendage equa- 
tion as is often done. The structural damping of spacecraft, is 
another parameter which is difficult to predict reliably. A typical 
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pattern is to select a number for, tlie modal damping (after 
transformation to distributed coordinates and truncation is accom- 
plished) , in (6.2), and if possible perform some ground vibration 
tests. In such tests it is difficult and sometimes Impossible to 
provide the boundary conditions indicative of orbital (free) flight, 
and the ground vibration data is suspect. In LST the task is further 
complicated by the unprecedented pointing requirements, and test data 
for damping with such small displacements will either be impractical 
to obtain or unreliable (or both) . For these various reasons our 
first choice of an LST model jti.^ excludes all flexible modes so that 
they can be approximated by the synthetic modes of the error system. 
Also we wish to begin by considering the least complex controller 
(hence smallest model). Thus, the model taken to be, from 

(6.6) neglecting e. 




3 .3 3 ^ _3 o 

X = A X + B u 

3 3 3 

y = c X 

3 >.3 3 

z =* M X 


( 6 . 8 ) 


where the pitch axis is selected for study in this planar problem, 
and where 
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6.3 Solution of The Error Free Problem (STEP II) 

The solution of the linear regulator problem for (6.8) follows. 

lOO T 

o o * 

min I (y Q y + u° R u°)dt 


o •'0 


subject to 


.3 

,3 


X 

= A 

X 

3 

3 


y 

= c 

X 

3 



z 

= M 

X 

x^(o) 

= X 


o 



■ [: i] ■■’■(:) 


= I, 


3 3 3 3 

yields (since (A , c ) is observable, (A ,B ) is controllable) 
where K obtains from 

T T 

33 3-13 3 3 

0 = - K A - A K + K B R B K - c Q C 


(6.9) 


The resulting K is 


K = 


fqiCqa"*" ^7^ ^ 


L 




where Q and R have been selected, in form, 
R = 1 


Q = 


t 


“ 

0 q. 


Noting that the closed loop system for (6.9), (6.10) is 

.3 -3 3 

X = A X 


( 6 . 10 ) 


( 6 . 11 ) 


( 6 . 12 ) 


(6.13) 

(6.14) 


where 
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0 


1 




-/I, 




(6.15) 


We can relate the parameters to the classical parameters for a 

—3 

second order system, co , by writing A in the form 

n 


a3 = 


- - 25 u 

n n 


and equating (6,15) to (6.16). The result is 
4 

12 = K 


(6.16) 


(6.17) 


Thus, if we wish, in this case, to choose the matrix Q on the basis 

of the classical parameters C and O) , we may do so since, from (6.14) 

n 

(6.17) 


Q = 


0) 

n 

0 2o) 


1)J 


(6.18) 


Note further that the requirement for Q to be positive definite 
implies, from (6.18), 

C > ,707 (6.19) 
for optimal solutions to (6.9). It is also helpful to rewrite (6.12) 
using (6.17) 


K = 


2K w 
2 

L n 


n 


00 

n 

2C 00 


nj 


( 6 . 20 ) 


Finally, the last required calculation of STEP II is 
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( 6 . 21 ) 


G, A - R-^ K = (- 03^ , -2K W ) 
i. — n n 

Rather than specify in (6.21) the free parameters (and 

thereby place the poles of immediately we will (since the model 

is so simple) carry them in parametric form for a while. 

In section 4.0 it was mentioned that deterministic designs by 

optimization have no natural band limiting quality. It is interesting 

to note in this simple example that the optimization solution (6.15) 

3 

allows pole placement anywhere in the region R of the complex plane 



Quite obviously, however, the fidelity of ■ the rigid body model 
is questionable beyond the spectrum of truncated modes. That is, 
from Appendix C, the eigenvalue magnitude of the truncated rate 
gyro modes, (from (C-31)), 
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IX I = 0) = 2tt 16 
rg' r 


and the eigenvalue magnitude of the neglected position sensor mode 

(from (C-33)), 

lx I = 2 tt 
' gs' 

and the eigenvalue of the truncated A/D prefllter mode (from (C-35)), 

|X i = 27T 20 
pi 

and the eigenvalue of the first appendage mode (estimated) 

I^FRUSaI “ 2it(.1) 

all establish bounds on the spectrum of credibility, C(^^), of the 
model Figure 6-2a. 

The Kalman filter design for model provides further insight 

into the effects of modeling errors. We will indulge in a brief 

digression now to see where the Kalman filter for the estimation of 
3 

X places the estimator poles relative to the above spectrum of 
credibility for model dy Using the d^ parameters in (6.9) and the 
Kalman filter equations of Figure 2-2, it can be shown that the steady 
state (i.e., = 0 in Figure 2-2) Kalman filter, 

X = dx + B u +Kz 


3 "^3 

d A [A - K M ] 




has the eigenvalues 


X = tr d 


where 


1 + , /I - 


4/d/ 


(tr d)' 


(b) 
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“ - (^J 


1/4 


(qrj^- Zx\) - 1t\ 


11/2 


dl A det d = - 


(-) 


(VyqT^) 

1/2 


(c) 


where 


= variance of the position sensor noise 
r^ = variance of the rate gyro noise 
q = variance of the disturbance torque noise 


R A 


0 r. 


* Q A q 


Note that the estimator poles leave the model credibility spectrum 
as q ->■ 0 and again as A(qr) > X (see Figure 6-2a) . Under 
these conditions the Kalman filter might diverge, because it is making 
predictions over time intervals (too long or too short) for which the 
model is not reliable. To see this more clearly, consider that the 


right boundary of corresponds to time constants ^ 


= T 

x=o 


Now since the filter relies absolutely upon ("believes") the model it 

* 

has built within, it places the estimator poles at the origin (corre- 
sponding to q/^ 2 ^) 0 in (c)) . This action serves to disconnect the 

measurements from the filter (that is K = Q in Figure 2-2) since the 
filter feels justified in making predictions over arbitrarily long 
intervals, T Now when the estimator places its poles at (or too 


* 

Recall that these events unfold in time according to the differential 
equation for K in Figure 2-2, evolving finally to the steady state 
values described above. 
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near) the origin, (to the right of C(jd^)) it can diverge because of 

predictions over longer intervals than that for which the model is 

reliable. When the estimator places its poles to the left of 

the estimator may diverge again due to predictions over intervals 

Inappropriate for the model (but in this event the intervals, T , are 

s 

too short, 1/X = T^, A ^) • In view of this problem we might 
naturally ask the following questions. 

1. Can this estimator be stabilized by better statistical data 
for the noise? Not necessarily. Assume that q, r^, r^ actually 
represent the properties of the physical system. It might still 
happen that the pole placement specified by (a) , (b) , falls outside 

(note that information concerning the boundaries of C(^3> is 
not contained in (a)). 

2. Can this estimator be stabilized by better plant parameters? 

Not necessarily. Just as with the statistical parameters discussed 

3 3 3 

above, there may not exist any plant parameters (A , M , B ) which 
will stabilize the divergent Kalman filter (see Reference 22 for 
further examples of this circumstance) . 

3. Can this estimator be stabilized by the adaptive Kalman filter? 
Again, not necessarily. As discussed in section 3.0 most adaptive 
filtering schemes increase q in some way. Note that increasing q may 
help if the estimator poles are initially near the origin. However, 

if ^2 correspond to poles near the left boundary of the 

adaptive Kalman filter will actually cause the state estimates to get 
worse. 
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The conclusion dravm from this digression with the stochastic 
problem solution for is that poor performance from a Kalman filter 
does not imply a poor choice of parameters for either the noise covar- 
iances or for the plant. The Kalman filter ignores the presence of 
boundaries of . The adaptive Kalman filter shares a fault with 

deterministic estimators of Luenberger that the presence of a left 
boundary of ) for this example, would be ignored. From this 

example we may conclude also that the deterministic (Luenberger) esti- 
mator does not have to be "faster" than the control system, as is 
typically suggested.^^*^^ This follows from the result which shows 
the equivalence between the steady state Kalman estimator and the 
Luenberger estimator. From the note that the Kalman filter poles 
might fall to the left or to the right of the poles of the closed loop 
control system depending upon the selection of in (6.18) and the 

(independent) parameter values q, r^, r^ in (c) . 

6,4 The Characteristic Time of ^^^(STEP III) 

From the derivation of characteristic time in (4.78), 


= A l[K + 

max 


T -1 

r Q c-^] 


K} 


( 6 . 22 ) 


= A 

max 


{K> 


where from (6.9), (6.13), (6.18), and (6.20), the eigenvalues of K are 


A{K) = 



2cj^(3?^- 1) 


^ + 


2 

2C - 1 

\/4c2-1 


* 

The more likely case is that the means of the random variables w, V 
are not zero (nor or they necessarily constant) . The method to model 
such correlated disturbances is to use the structure of the error 
system proposed in this study, specifically in section 5.0. 
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Then 


max 


{K} = 


2 

4C - 1 


203 (3C - 1) 

n 


Z + 


ir}- 1 
/kxp-- 1 


(6.23) 


Note that the eigenvalues of the closed loop system are 


X[A' 


^] = - ± 1 ] 


31. 


6.5 Characterization of The Error Vector e (t) (STEP IV) 

In the absence of other information one may assume a "mode shape" 
31 

for e (t) , as one alternative discussed in section 4.0. We choose, 

31 

however, to calculate directly that e which is defined in (6.6), 

31 

(6.7) (actually, e A e^ in (6.6)). From the approximate data given 
in Table 6-1, (6.7) yields 

e (t) = .3e + .3e cos .62St 

+ .007 sin .00111 t - .08 sin .00222 t + .007 sin .00333t 
+ .007 sin .00444t (6.24) 

where we have used the calculation of the external disturbances, T^, 
in Newton meters, 

T = - .034 sin .00222 t + .02 cos .00222 t 
o 


.003 sin .00111 t + .003 sin .00222 t + 

.003 sin .00333 t + .003 sin .00444 t 
which is an approximation of (C-69) in Appendix C. 


(6.25) 
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Table 6-1 




LST PARAMETERS 

ASSUMED FOR SOLAR PANEL MODE 

r$ 

a: 

15.8 



= 

.01 

modal damping 

a 

= 

ZttC.I) rad/sec 

solar panel mode natural frequency 

“l 

= 

10.0 Kg 

modal mass 

F 

= 

- 46000 Kg m 

2 . . j 

axxs xnertia 

X 

= 

.05 1/sec 

thermal time constant 

A 


1.0 m 

steady state thermal deflection 

o 




K 

o 

= 

.00111 rad/s 

orbital rate 

m 

= 

8,000 Kg 

vehicle mass 


6.6 Selecting The Synthetic Modes (STEP V) 

Here we choose the Chebyshev error system 

-31 „ 

e = Py 


Y = DY 


where 


0 0 0 0 
10 0 0 
0 4 0 0 

3 0 6 0 


•. dd 


and P is defined next. 
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6.7 The Chebyshev-Fourier Coefficients, P(x,d) (STEP VI) 


From (5.12) 


= g(a) 

•'-1 


31 T -1 

e (a) T (a)da A ^ 


(6.27) 


A 


-1 


2 

7T 


1/2 0 

0 I 


d-1 


Using the Chebyshev quadrature formula in (A-17) for evaluating (6.27) 

31 

we obtain, using the data in Table 6-1, and e (t) given by (6.24), 

"b 0 0 0 


P = 


Pi P2 


(6.28) 


2d 


where 


= .645 
P2 = -.155 

Pj - .62 

P 4 “ -05 


where we have now selected for w in (6.17) 

n 

C = 1.0 


to = .4 

n 


(6.29) 


6.8 Feedback Gains for the Synthetic Variables of the Error System 
(STEP VII) 

Now we solve for L from (5.13). 


3 -13 
K B R B 


L - LD = KP 


(6.30) 


The recursive solution to (6.30) given by the lemma of section 5.1 

T 

yields (define S^ A b\"^B^ ) 
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|ks^- J k 
[ks^- l" 

[kS^-A^^] 

[kS^- ^kp'»-\ l'^ 


°d(d-3)) 


[kS^- A^’'] ^KP>“-V l‘*'^D(^_ 3) ^■'^■^(d.X) (d-4>) 


Now since 


3 3 3 

K-^S-^- A-^ 


T-1 W 

C n 

1 


3 3 3 3 

K S - A K = 


(4^^-l)oo^ 0 

n 


2C 0) 1 

n 


then the columns of L become, noting the values of in (5.11), 
for d = 1 , (denote L by L^) 


l 1 l^= r 

1 iPi 


For d = 2 , (denote L by L^) 


148 



(6.35) 



(6.36) 


(6.37) 


Thus, as d is increased, the old values of L are retained and a 
correction term is added. 

For the matrix G we must compute 


Hence , 


r -1 3^ 3 -1 3^ 1 

G = - R B K , - R B Lj = 

from the data (6.1), (6.3) and 



(6.38) 



T 

K 


3 




(6.39) 


-1 3 

- R B L = 



(6.40) 
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From (6.12), (6.14), (6.15) , r(6.13) can be written, for d £ 3, 


G = l-to , -2^ w , - P 21 + 2 2 ^23 ’ 

X 0) 

n 


(6.41) 


- P 


22 ’ 


- P 


23 ’ * 


and 


G^= -P^+ 


16 

22 P3 » “ P2 » " P3 ’ * 


X w 


n 


(6.42) 


where p. = 0 for i > d. 

6.9 Estimator Gain Determination (STEP VIII) 

In order to gain the insight available from an analytical compu- 
tation of the estimator gains, choose d = 2, a tractable case. Now 
first we will follow a simplified version of STEPS Villa, IXa, listed 
in section 5.2. 

To actually get a meaningful evaluation of the controller design 
a more substantial "evaluation” model should be used than the model 
used here. We will here simply show a choice of estimator gains 
which makes (5.16) a stability matrix when j=3. The alternate 
STEP Vlllb may also be selected for the task of stabilizing the 
estimator (with perhaps even less effort) . 


Now for the matrix A in (5.16), using the model given in 


(6.9) 


A = 


A^+ B^(G^+ G^ T^^) I G. 


^'^21” ^21 ^ 


D 


(6.43) 


Define the parameters within T 


21 
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X = 
21 


0 t, 


0 t 


(6.44) 


where the zero entries result from the fact that the matrix P has a 
zero first row (hence the eigenvalues of the estimator , 

^^^22^ = , are independent of the first column of . 

The closed loop system is now described by two physical variables 
^ 2 ’ ^ 2 ’ synthetic variables of the error system state esti- 

mates y^. This system is written, from (4.115), using (6.42)-(6.44) , 


0 


1 

2 

-w 

n 



Y 

0) 

n 

*^2 

2C^n 

2 

0) 

n 

‘4 


0 


0 

-(/ 

n 


p^t^- 

2 

0) 

n 

'^2 

n z 

2 

CO 

n 

■^4 

2C(iO t,+ - 1 

n 4 T 


i 0 

T 


(6.45) 


where 6 , 9 represents, as did z" in (4.115), the measurement residual 
errors (the difference between the actual measurements and those pre- 
dicted by the model) . As noted in prior discussions these residuals 
may be characterized as noise in the estimator gain evaluation, if 
desired. The transfer function between the position 62 ( 5 ) and the 
model error, R(s) , is, from (6.45), 
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and, consistent with our expectations, there is no upper limit for 
(based upon these stability arguments) . 

Now the controller which must be implemented according to this 
design is illustrated in Figure 6-2 in block diagram form and given 
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below in equation form, from (4.111), 


/A 



\ 



0 0 
2/t 0 



0) ^ t„ 
n 2 

2 ^ 

0) t, 
n 4 


2C 0) t_ 
n / 

2? 0) t,+ - t„ 

n 4 T 2 



o 


u 




-25 tjP,- 



(6.50) 


The state estimator, (4.91) (and the lower part of (6.45)) and the 
closed loop evaluation model, (6.45) (from (4.115)), are both stabilized 
by the choices 


0) 


n 


t„ = 


t, = 


1.0 1 

.4 

-4.6 

-25.6 


T = 3.0 


2 

It is further noted from (6.46) that due to the s term in the 

numerator the vehicle pointing position will be insensitive to any 

model errors, e(t), which are closely approximated (within time 

constants of (6.46)) by a constant or a ramp in time. Now, decreasing 

(o3n w^) will give better noise rejection, but for the larger time 

constants associated with smaller oo w , fewer model errors can be 

n e 

approximated by a constant and a ramp. Consideration of sensor noise 
data, when available for substitution into (6.45) (for will 

permit perhaps better choices for 03^, t^* t^ from the multi-model 
procedure described above. The competition between the "low frequ/.;ucy" 
and the "high frequency" effects of modeling errors indicates that a 
"best" set of design parameters exists. Generally, however, neither 
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the Kalman filter nor the Luenberger observer designs provide these 
best parameters, owing to their commitment to a fixed model whose 
resulting controller performance is eventually limited by the errors 
of the model (such as truncated modes) . 
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7.0 Concluding Remarks 


This research has been concerned with that deficiency of modern 
control theory which causes it to fail to recognize limits to the 
fidelity of the mathematical model. A modeling plan is proposed for 
model construction which allows a parameter of the control problem 
(called the characteristic time, t) to influence the modeling problem . 
This is accomplished through the introduction of an error system which 
is augmented to the model of the system and which is designed to 
approximate the effects of model errors (external disturbances, trun- 
cated modes and parameter errors) only over the Interval, t. The error 
system chosen belongs to a class called the Sturm Llouville equations 
which are known to generate orthogonal functions. The benefit such 
an error system provides to the closed loop linear regulator controller 
(using linear state estimation for feedback of the states of the model 
and the states of the error system) is that when measurements from the 
physical system differ from those predicted by the model (built within 
the state estimator) the difference is automatically fitted with a set 
of orthogonal functions (for a least squares fit over the interval t) 
and the corrected signal is used for control. The estimator for the 
error system is therefore called an "orthogonal filter" and is used 
for control by model error estimation. 

1—6 

This report lends further support to prior claims that 
increasing the order of the model can provide better systems perform- 
ance. When the "model error vector" described above is considered to 
result only from external disturbances, then these procedures can be 
used to design "disturbance accommodating controllers," discussed in 
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Reference 4 when the "model error vector" is considered simply as white 
noise the Kalman filter can be obtained. The method has a computa- 
tional requirement similar to that of the singular perturbation 
8 

approach, in that a Riccati equation need be solved only in the 
smaller dimension of the "error free" system model (corresponding to 
the "truncated" model of Reference 8) . 

An example illustrates the application uf orthogonal filters in 
the special case when the orthogonal approximating functions are 
sc^iected to be Chebyshev polynomials. The Chebyshev error system 
that results is then applied to the control of the Large Space Tele- 
scope (LST) which the National Aeronautics and Space Administration 
(NASA) plans to use for extragalactic research in the 1980's. 
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8.0 Future Research 


The following areas seem fruitful for further research. 

31 

1. Model Learning Observers; The model error vector, e (t) , 

was defined to be that vector which makes the output of the smaller 

model equal to the output of a larger, more credible model. The 

31 31 

approximations used for e (t) , which were called e (t) , do not in 

fact render the two outputs exactly equal. The measurement residual 

which results could be further approximated ty considering the model 

error vector to influence the measurements directly through some matrix 

M 

Y 

2^ = x^ + Y , (M^ Y * *2^) 

which fliatrix, M^, could be adaptively regulated to decrease the meas- 

0 2 

urement residual, z (t) - z'(t). Such a procedure, if found, would 
make the estimator "model learning" in the sense that better knowledge 
would be continuously obtained about how the model error vector 
actually influences the measurements. 

2. In the Tri-Model design approach no simple procedure has been 
established for the solution of the estimator gains, 3^21’ 


min J(T 21 ^ 


min 

T 

21 


/ 

Jc\ 


Q y dt 


when the output y evolves from a different model, than that used 

for the estimator design, -^ 2 ’ T “ difficulties can arise in 

evaluating J, > (see above) when the model error system is unstable. 
'' 21 '^ 

(Recall according to an evaluation of the closed loop system with 
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Model ^ 2 * eigenvalues are equal to those of the error system in 
addition to those of the optimally controlled error free system) . 

From a practical engineering point of view such solutions might well 
exist, and further work is needed to provide practical approximations 
for T 
mathematically. 

3. Further reductions in the order of the controller can some- 

times be realized by estimating the linear functional Gx instead of 
2 

first estimating x and then multiplying by G. Possible reductions in 
controller order should be investigated for the special matrix struc- 
ture of the orthogonal filters. 

4 . Other practical problems should be solved using Chebyshev 
and Fourier error systems to determine some guidelines to indicate 
which o::thogonal functions one should select. The limitations of the 
error system approach should be investigated. It seems likely that 
stability considerations will eventually limit the order of the error 
system (and equivalently the degree of the orthogonal functions) which 
can be augmented to the system model. 


21 


even though J 


''T ) = 

' 21 -' 


min 


f 

•'0 


y Qy dt might not exist 
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APPENDIX A 


LEAST SQUARES FITTING OF DATA FOR MODEL ERROR 


APPROXIMATION WITH JACOBI POLYNOMIALS 
In this appendix we assume that the model error vector, e(t) , is 
a given function of time and we wish to approximate e(t) by e(t) where 

e = Pf f^ = (f^,...,f^) 


and f^(t) are Jacobi polynomials. This class of polynomials is one of 
interest because as special cases within this class certain other 
desirable polynomials are obtained, such as the Fourier series, the 
Legendre and Chebyshev polynomials. Before specializing to these 
cases, however, we will first develop a model error system 

e = Py 


Y = Dy 


(A.l) 


which is capable of generating the more general Jacobi polynomials. 

It will be convenient in this Appendix to normalize time to vary over 
the interval [-1, 1] instead of the interval [0, i] . Such a change of 
variable is accomplished by the definition 


a A 2 - - 1 

= T 

The matrix P which minimizes 


tc[0, T] 
aC[-l,l] 


(A. 2) 



- e(a))^ q(a) 


(e(a) - e(a))da 


(A. 3) 


subject to 

e = Pf 

e(a), f(a) g(a) specified o’C[“l>l] 
is given by 

REPRODUCIBiLrrY OF TiiF 
ORIGINAL PAGE IS POCP? 
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where 



Then from (A* 5), (A. 6) , 
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Proof; 


fci 

The Jacobi polynomials f^(o) * P^'" are orthogonal on the 

ct S 

interval crC[“l»l] with respect to the weight g(a) = (1-a) (1+a)^. 


This implies 

(ct) g(cJ)da = 0 
and from Reference 4, p. 774 


'i' 


i ^ j 


4:2, . , , . _ rci+g+l) r(i4-g+l) I 

^i^ (2i+a+3+l)i! * r(i+a+3+l) L 

The Jacobi polynomials may be expressed by the Rodriques formula 
p. 58, 


P, 

1 


(ot,3)(^) ^ Izili (l-a)"®^ (1+0)"^ ^ 

2 i! da 


( 1 - 0 ) “■^^( 1 + 0 )^'^^ 


L 


Theorem A-3 

The matrix P which minimizes (A. 3) subject to 
e - Pf 

(ct 

where f^(0) = P^'“ *^"^(0) are Jacobi polynomials, is given approxi- 
mately by 

fy kf_^ s(a) fj(a)da 4 


(A. 8) 


where 0^^ is defined by 


p (a*3>(a ) = 0 
n k 


and the coefficients 
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. = r(ttt«+i) r(n+e+i) 

nl r(n+c«.B+l)a-0^)[^P<“’®>(a^)]" 

The positive numbers aj^ are called the Christoff el numbers. For a 
proof of the Gauss-Jacobi quadrature formula, (A. 8), see Reference 64, 

p. 112. 

Now that P and f are defined, the "open loop" model of the error 
system, as discussed in section 4.2.1, is determined by 

e = Pf (A. 10) 

However, to generate e from the "closed loop" model of the error 
system (see section 4.1.2) we must define a set of differential 
equations the orthogonal functions f^(a) are known to obey rather 
than accept the "open loop" description of e as given by (A. 10). 

The Jacobi polynomials are eigenfunctions of the Sturm-Liouville 
equation (taken from p. 311 of Reference 64, Equation (7.8.4)), 

(1-a^) p^(0) [g-a-(a+e+ 2)a] p^(a) + i(i+a+3+l) p^(a) = 0 (A.ll) 
(i = 0,1 d-1) 

However, (A.ll) is a time varying system and if placed in the first 
order from 

'i. = [P,0]y(a) (A. 12) 

fv(cs) 

Y’ = D(a)Y(a) Y 4 

\p’(a) 

then D is a 2d x 2d time varying matrix which complicates our intended 
procedure for real time state estimation of y. 
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Chebyshev Error Systems 


As a special case of the Jacobi polynomials, those 

obtained when a = $ = - 1/2 are called Chebyshev polynomials (multi- 
plied by a constant, c^) . 

Theorem 4.4 

The Chebyshev polynomials, f^(a) = T^(a) = — P.(-l/2, - 1/2) (a) 

1 

yield for the g(a) in (A. 3), (A. 4) 

g(a) = (1 - (A. 13) 

and 


r ^ 

'-1 

g da = A 


• 

A-f 

‘2 o' 

0 1 

. A-" = ^ 

IT 

1/2 0 
0 , 


d-1 

d-1 


(A. 14) 


where 


^ r(i + 1 / 2 ) 

i! 


(A. 15) 


The proof follows from substitution of a = 3 = - 1/2 in Theorem A. 2 
and from the normalization defined for the Chebyshev polynomials, 
resulting in the C^ in (A. 15), from (22.5.31) of Reference 65. 

Theorem A. 5 (from Chebyshev quadrature formula, Equation (25.4.38)) 
The matrix P which minimizes (A. 3) subject to 

e = Pf 

where 

f^(a) =» T^(a) 

are Chebyshev polynomials, is given approximately by 

Ijj^PRODUdBILriY in 

PAGE IS 
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(A. 16) 


''jj ’’ij ° Ii s(a)e^(0)fj(c»da - ^ 
where are points when 

a, = cos 
k 

Equation (A. 16) is the Chebyshev quadrature integration formula, 

(25.4.38) in Reference 65, but this result can also be obtained 

directly from (A. 8) and (A. 9), using a = 3 = “ 1/2, noting (A. 15) and 

Cj T . (a) = P . ^ (a) . The matrix elements A,, in (A. 16) are 

i i i 

defined by (A. 14). 

The Sturm-Liouville equations whose eigenfunctions are Chebyshev 
polynomials are given by (A. 11) with a = 3 = - 1/2 

(l-a^)p\'(a) - a p^(o) + i^ p^(a) = 0, i=0,l,...,d (A. 18) 

and the first order forms corresponding to (A. 12) are 

P'.(a) P.(a) +~~P. Ao) i-0,l,2,...,d (A.19) 

1 1-a ^ 1-a 

The model error system using (A.19) is written 



e = Py 


Y = Dy 


f = ^ 

da 


where P is given by (A. 16) and 



(A. 20) 


(A. 21) 


173 



Now (A. 21) has a time invariant representation which can be obtained 
by 

Theorem A. 6 

A time invariant model error system which can generate the 
Chebyshev polynomial least squares fit to the data e(a) is given by 


e = Py 

Y» = Dy 


(A. 22a) 


where P and are given by (A. 16) and (A. 17), respectively and 


D = 


0 

1 

0 

3 

0 

5 


0 

0 

4 

0 

8 


0 

0 

0 

6 

0 


0 10 


0 

0 

0 

0 

8 


0 

0 

0 

0 

0 


0 10 


0 

0 

0 

0 

0 

0 


0 12 


12 


0 12 


0 

0 

0 

0 

0 

0 

0 


(A.22b) 


L J"'"' 

Proof ; 

The only thing new to prove is that the dxd matrix D can be 
represented in the time invariant form of (A. 22b) instead of the time 
varying form of (A. 21). Observe that the Chebyshev polynomials can 
be written in the form 


k=»0 

and the inverse relationships 


(A. 23) 
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can. be taken from Table A-1. Differentiating (A. 23) with respect to a 
and then making the substitution (A. 24) into (A. 23) gives in a straight- 
forward manner 

T’ = DT A (T^,' T^,...,T^^^) , T’ A 


where D is given by (A. 22b) 

Note that when normalized time, O’, is replaced by t we have from 
(A. 2) Y = (2 /t)y’ and the matrix corresponding to "^ = Dy is obtained by 
multiplying (A. 22b) by 2 /t. 

Now we proceed to see how any solution of (A. 22a) relates to the 
Chebyshev polynomials, f^(a) = T^(a) . 

Theorem A. 7 

Any solution of (A. 22a) satisfies 


e = P(Y(o))T A^T^(a), T^ (a) , . . . , T^_^^ (a)j (A. 25) 

where the T^(cr) are Chebyshev polynomials of degree i and the coeffi- 
cients of T(a) are functions of the initial conditions of (A. 22a), 
specifically. 


P Y(0) A P 


Y^(0) 

Y^(0) Q 2 


(A. 26) 


y"^(o) 


where 
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Table A-1 (from Reference 5, p. 795) 
COEFFICIENTS OF THE CHEBYSHEV POLYNOMIALS AND THEIR INVERSE 



0 

1 


2 

3 


4 


5 

7 


a 

a 


a 

a 


a 


G 

a 

b. 

X 

1 

1 

2 

4 

8 

16 

32 

^0 

1 1 


1 


3 

j 

10 



1 

1 


3 


10 


T 

2 

-1 


2 1 


4 


15 



-3 1 

1 


4 

1 


5 



-1 

i 

1 

-8 

] 

■j 

1 ® 

1 


6 



5 ! 


-20 


16 1 


^^6 

-1 

i 

-18 


-48 


32 1 


Example: T,(a) = 32 - 48 + 18 - 1 

o 

^ 10 T + 15 T„ + 6 T, + T, 
32 o 2 4 6 











Proof: 

The state transition matrix for A in (A. 22b) may be computed 
exactly from the finite series 


$(a,a ) = d 8 + ^ fr + 

o a z 3: 


a A G - a 


Then, from (A. 22b), (A. 27) becomes 
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0 


0 


0 0 


0 0 


$(a,a) 


3a + 4 a 

a2 a4 

16a + 8a 


8a + 32a" 


0 0 


1 0 


8a 1 


By substituting a = a + 1 (taking a^ = -1) and using (A. 24) and Table 
A-1, $(a, - 1 ) can be written, after some labor j 


T +T- 
o 1 

3T +4T.+T„ 

O 1 z 

$ (a , -1) * 13T^+18T^+6T2+T^ 


4t +4t, 

o 1 

18T +24T^+6T^ 
o 12 


6T +6T, 
o 1 


59T +88T,+36T^+8T^+T, 88T +128T,+48T^+8T^ 36T +48T,+12T^ 8T +8T, 
o 1 234 o 1 23 o I 2o. 1 


$(a,-l) = 


(A. 28) 


where 
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= Q^T A 


<^2 = ^^ 2 ^ ^ 


^3 = V 4 



^4 = Q 4 T = 


(155 = Q 5 T = 


Now since 


13 

18 

6 

ll 

18 

24 

6 

0 0 

6 

6 

0 

0 

1 

0 

0 

0 



0 

0 


59 

88 

36 

8 

1 

88 

128 

48 

8 

0 

36 

48 

12 

0 

0 0 

8 

8 

0 

0 

0 

1 

0 

0 

0 

0 



0 


0 


y(a) => $(a,-l) y(0) = 


<^l y(o) 


>d ^<o> 


it follows from e = Py, (A.29) and (A. 28) that 
Ho) = P(y(0))T 


where 




p = p 


y''<0) 

« 

y'^(O) 


On the basis of Theorem A. 7 the closed loop "synthetic mode" 
description of the error system (discussed in section and 

described by (A. 22a), is seen to have certain advantages over open 
loop descriptions (a priori selection of e(a) as a prescribed function 
of 0) . Specifically when on-line state estimation of yiO) is accom- 
plished (in response to real measurements) then (A. 25) shows that the 
model error vector, e(a), is a least squares fit to the actual error 
vector e(0) using Chebyshev polynomials whose coefficients are appro- 
priately and automatically selected by the current state estimates 


for Y(a) . 

Fourier Error Systems 

Let the Chebyshev polynomials be written 

T^(a) = cos(n cos ^ a) 0C[“1>11 

and make the change of variable 

cos 0 = 0 9C [iT.O] 

then, 

F (9) = T (0) = cos n 0 
n n 

Furthermore make the change of variable 

9 = wt + t C [0 ,t] 

For consist ant limits 


(A. 30) 


(A. 31) 


(A. 32) 
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A time invariant characterization for the Fourier error system is 
possible by increasing the order of the error system to 2d. 
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APPENDIX B: EVALUATION OF THE COST WITH RESPECT TO MODEL 


In this appendix we wish to derive an expression for 
^T T 

J(T 2 ^,d) = E J y° (t) Q y°(t)dt 


(B.l) 


sub j ect to 


x = Ax + G w +G, w, 
r r d d 


(B.2) 


y = y + y = cx + y 


E[x ] = 0 = X 

o c 

E[x x^] = S 
0 0 o 

E[W ] = 0 = w 

r 1 


E[W^(t)w^(a) ] = 5(t-a) 


(5 is Dirac delta) 


E[x w (t)] = 0 
o r 

E[W^1 - 

T T 

E[w, w ] = Wj Wj 
d d d 


e[7] 


n ”1 
= 0 = y 


E[y(t) = Ry 6(t-cJ) 

E[y(t) w^(t)^] = 0 


E[5^(t) x^] 


= 0 


Define 


E _A [ (x-x) (x-x)"^] , X A E[x(t)] 


The fora of the solution for (B.2) is 


:(t) = $(t,t )x + r $(t,x)(G w + G.w,)do- (B.3) 

o o r r d d 
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Then 


$(t,a) G,w, da 
a cl 


x(t) = 

•'0 

Now the covariance matrix ^ becomes 


(B.4) 


2=E- 


$(t,t^)x^ + J $(t,a)G^w^daj|i(t,t^)x^+ J* $(t ,a)G^w^d<sj 


But since is assumed white, the screening propfsrty of the Dirac 
delta function present in the covariance kernel for w^(t), see under 
(B.2), allows us to write 

$(t,0)53 $(t,0) + f $(t, 0 )G Q G^’ $'^(t,a)da 

Jq 

Differentiating (B.5) with respect to time yields 

t = + + . E<o)=E„ 

An alternate expression for (B.l) is 


Jdj^.d) = E JT 


(cx + y)^ Q(cx + y) 


(B.5) 


(B.6) 


= E. r (2^ 
•'O 


iwT _ , ^T _ , T _T \ j . 

Qcx + y Qp' + X C Qcx)dt 


(B.7) 


But from the fact that the state is uncorrelated with the measurement 
error y*, and using the trace notation 

T 

J(T ,d) = tr f (Q R (t) + c'^^Qc E[x(t)x'’^(t)])dt ' (B.8) 

•'0 ^ 

Now since 

]^ = E[ (x-x) (x~x)^] = E[xx^] - X x^ 


Then 


fp ^ 1^ rr 

E[x X ] = 2^ + X X 
allowing (B.8) to be written 

J(T-.,d) = tr I [QR (t) + c'^QCZ + C^QC x x^]dt (B.9) 

But since does not depend upon an equivalent problem is 

m /*T 

J(T ) = tr C^QC / (S + x?)dt (B.IO) 

Jq 
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APPENDIX C: FLEXIBLE SPACECRAFT WITH THERMAL DEFLECTIONS 


For our present purposes LST is modeled as a central rigid body 
to which is attached appendages such as the solar panels, light shield, 
and possibly the Service Support Module (SSM) . The central rigid body, 
B, is the primary /secondary mirror assembly together wi.th either all 
or part of the SSM and its subassemblies. 



We require the rigid body B to maintain its inertial attitude in pitch 
and yaw within .005 seconds of arc over a five hour period. 

Following^^, R is the inertial position of the system mass 
center, M is the system mass, and the applied resultant force is the 
•vector ^ , see Figure C-1. 

The procedure and notation of Reference 66 will be followed 
without elementary developments. For further details the reader 
should see section III of Reference 66. The addition of thermally 

K . PAQE BLANK NuT t 
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INERTIAL 
REFERENCE, f> 


Figure C-1 . Flexible Body With Thermal Deflections. 


induced changes in the undeformed position of the appendage is the 
only fundamental difference from the development in Reference 66, 
section IIIA. The plan is to write 6N + 6 scalar equations (2N + 2 
Gibbsian vector equations) . 


4 



= 

-B 

— B 


i ..i 

F 

= m _r 

mi 

•i 

T 

= H 


rigid body B 


, i=l-N 


appendage 

(N discrete nodal bodies) 


The resulting system of scalar equations will be linearized for 
further study and truncation. Preliminary facts and definitions which 
will be useful are: (see Figure C-1) 


b == 0 A 

^ = C ^ , 0=1^ indentity matrix 

d_ 

a = Ba , B = I- e 

a b 

w ^ ^ = angular velocity of B with respect to inertial space. 

I ± a i ^ :i ^ 

a i r ,T ’i *i 

w = la} 3 = £ 

N .... 

Mg ^ + 2]^ (g. + A ^ definition) 

r=q=^c=R (cm non-accelerating) 
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Appendage Equations; 


Write, from Newton's second law, 


( 5 ^*^+ ‘A i. ^ 


But from CM definition and ^ write, 

N 

^ 2 + a^)m^ = 0 

ft. “ 

and (C.l) becomes, using (C.2), 


or 


_i X , aI, I ••i\ m !/•• i a^a 
F =m(r+^+a + £)- — 2^ (^ + £ + A + a )m 

k=l L J 


m 


N 

/••k , A*k , ..k. k 
V (£ f A + a )m 

k=l 


^ A mass of appendage. 

Now compute 

r = r + 2u3>:r + wx (a)Xr)+a)Xr 


r = u X r (linearized) 


Now 


..i ^ ^ L ^ ^ ^ V- V i 

£ ~ Jz + 2 w x^^^ + w xyo) ^£ 1 + ^£ 


-i • SA i 
£ “ £ ^ £ 


(C.l) 


(C.2) 


(C.3) 


(C.4) 


(C.5) 


(linearized) 


(C.6) 



(C.7) 


A^+ aV 2V"'x(A^ + a^) + V^x (V'"x(A'^+ /)) 

+ ^0)^ X (A^ + a^) 


°?1 , ooi 

-A + a 


Then (C.4) becomes 


(linearized) 


= (l--^) n.^ » X£ + (l-^) X£^ T+ ) 


^ D X + a^) 


The Euler equations for the appendage sub-body A are 
-pi 

T = H 


where 


•1 Hi £ i ^ fli £.i 

/pi ^ i ^ [lij ^ i ^ Ri 
“ 1= + £x£l. w+U • w 


£ i ^ Hi vi ^ r|i ^Ai 
= toxu *a)+U *0) 


*i nl • /£.b , a.iv 

H = y ’(w + w) 


(Linearized) 


a*i ae.1, a i a i aoi °°i 
0) = 03 + 0) X 03 = 0) = 3 


Hence 


« 0^ • (<I) + 3^) 


Now for matrix forms we record 


w = {b}'*' 03 

0) = {b}^ d) 
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(for small 3) 


6^ » {a^} 3^ = 


{b}'' r 


- {a}"^ = {b}^p^ (for small 3) 


°»k r iiT..K /, iT*;k 

A = la > A “ lb} A 

ook r r- ■! T*«k 

a = la } a = lb} a 


(for small A^,a^,3^) 
(for small A^,a^,3^) 


1 T i 

Now (C.8) becomes, in frame (note F = {b} F ) 


F^ = 


(, I i T- ( - m^\ i .'T i . vl , ..i 

= y.- y m tor + yi- ^ lm(tjjp + A +a 


1 N 1 ’*1 11 

m ip-b >• k , .k , ..k. k 

-TT (wp + A + a)m 


and (C.IO) becomes (note = {a^} I^{a^}~ {b}^ {b} for small 


.k k -kv 

A ,a ,3 ) 


i *L • **i 

T = j (to + 3 ) 


Now combine (C.ll), (C.12) using the definitions 


T . , T „T. 

q A (a , 3 ) 


T , ( A 2^^ 
a A ya , a , . . . ,a / 

i”^_ / i i i\ 4-11 
ot = 1 ^ 1 * ^ 2 ’ ^ 3 )’ i“ljl 

3 = \3-^ , 3 r ) 

{&{. 3^ 3^) 


i HL ^i 
T = + T 

F = + F 

3^ (interconnection moments between subbodies A^) 

— i f i i i 

F = - K a (interconnection forces between subbodies A ) 
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• torsional stiffness (3x3) , = translational stiffness 

^i ^i 

and where F , T , are externally applied forces and torques on 


(3X3) 


body A^. Rewriting (C.ll) 

i ..1 / i ..i 

m a + U-^ m a 


m N ..N ^ f i i 
-TT m a + K a 

.y" 


^i , r i I in 'I R j * 

? + m (r + p ) ^ m (r + p )Ja) 


j j X N , , 

i vi m ^ k vk 
ra A - ~j7- / . m A 


For N equations (i*l N) the matrix form can be written 

f .. f f 1^* f •• 

Ma+ Ka=f + MRoj + M A 

where 


l/, ^ ^ 2 , 

■" \-ji I ^3 

2= 2 ^ O / 2 \ 

fw* m m ^ 2/, m \ , 

^ --jrh “ 


m N ^ 

h P 

2 N ^ 

_ ^ J I A 0 

^3 ’ ^3= ^ 

! 0 


N-l N 
m m 




N 1 
m m J 

^ 3 


N 2 

m m J 
^ 3 


■ V- Ji) 


I3NX3N 


3NX3N 


RBPRODUCIBlLIl l 
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(A 

> 

0 -rj 

A- A 

1 

1 

A 

r A 

r. 0 -r. 


1 


3 1 


vN 


-r- r, 0 


3NX1 

2 1 


3X3 



Now, from (C.12), 

J 3 + 3 = T - J' w 


(C.15) 


where 


• 


- 



.1 

J, 0 


J 

s 


I 

O 

/ 

/ 

^rzS 

j'= 



3NX3N 

- 


J = inertia matrix of 
subbody about 
its mass center 


'K = 




'V 


3NX3N 


Together (C.14) and (C.15) imply 
+ Kq = L' 


(C.16) 


where 



M = 


f 


f , 

M 0 


K 0 


, K = 


1 

t-j 

2-j 

7 * 

_ 0 


L’ = ii - (L + 


(C.17) 





"f 



^ A 



-M R 


M 


/ 

6 NXI 

+J' 

m * 

6NX3 

_ 0 ^ 


For appendage equation summary, see Figure C~2. 

Now the thermally induced displacements of the appendage subbodies 
are assumed to satisfy the first order descriptions 

A » |l - A^ (C.18) 

where _ 

r"_A^ 

0 


-A(t-t) _ 




e N 

s 

\ 

N 

S 


0 




'* N — 
'--r(t-t) 


0 


0 


3NX3N 

0 

0 


-X^(t-t) 


3x3 


(>:r ■ 


ttl 

= thermal time constant associated with the k direction for 


subbody A (assumed given) 


A^ = steady state displacement due to change in temperature 
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For elements of the appendage, A^, which are assumed unaffected by 
solar Incidence, set = 0. If all such thermal effects are to be 
ignored set X^ = 0, j«l-N, and note that A = 0 = A and (16), (17) 
reduce to 

Mq + K q = d) (C.19) 


Now to obtain the expression for A needed in (G.17), differentiate 
(C.18) twice to get 



-A(t-t) 

e 


A 

o 


(C.20) 


where 


1 



X^ 0 0 

> 

ro 

II 

> 

> 

II 

a ' 

o 

1 

II 

0 X^ 0 

0 0 x|_ 


If we further assume that the thermal time constant associated with 
each modal body and each direction is the same then 

A = - X^ ^-X(t-t) ^ (C.21) 

o 

and the appendage Equations (C.16) become 


Mq + Kq = X^e'^^’^'^^A (C.22) 

o 

If the system is time invariant then the convenient choice t = 0 is 
allowed. The appendage equations are summarized in Figure C-2. 


EEPEODUCBILIIi 0- 
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Mq + Gq + Kq = A — r W + MA 



Figure C-2. Flexible Body Appendage Equations 
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System Equations; 


To obtain the system equations write the Newton, Euler equations 
F = M R 

T = H + fi (C.23) 

— —V — c 


where 

H = momentum from rotors 
H A r p X p dm vehicle momentum 

p = generic position vector from vehicle mass center 
Now omitting the details of the expansion of H and H which is 
contained in Reference 66 we may write immediately after linearizing 
(128) of Reference^^, 


T = J*0) + H +u)H + r2m^a^ + 53 


i=l i=l 


i-1 


where 

^ i ^ V, / 

Si ^ 1 


i /k CMSs 

V reaction wheels) 


(C.24) 


(C.25) 


til 

^ scalar magnitude of momentum of i reaction wheel rotor 

h^ ^ scalar magnitude of momentum of each of k CMG rotors 

i th 

^ unit vector describing direction of momentum vector of i 

CMG rotor relative to the frame 

i til 

A unit vector in direction of momentum vector of j reaction 

wheel rotor relative to frame 

* 

J ^ Inertia of vehicle (considered as a rigid body) 

i i i i 

Then since p + A p by assumption of small A , 
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(C.26) 



• • i. «. 

since for CMGs h = 0 and for reaction wheels 5 =0. Moreover, since 

o 

is=l-k is a function of the gimbal angles, a matrix S may be defined 
such that 

ip = S(6)5 ip = 3 X 1 (C.27) 

S = 3 X gk 

S = gk X 1 

where 


;p = ip^ + ip^ + — ip^ 



for k CMGs, each with g gimbals (g = 1 for single ^ 'mbal CMGs and 
g = 2 for double gimbal CMGs) . Then 



Mq + Kq = fi, + 5^ s - (C.30) 

Characterizing the Control Torques, T^ for a Reaction Wheel System. 


The control torque may be characterized in many levels of detail, 
depending upon how deeply one wishes to penetrate into the control 
hardware dynamics. The level of detail chosen must be compatible 
with the fidelity of the rest of the model. For linear system 
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characterizations such as we use here, this implies that the eigen 

values of the homogeneous appendage equation (setting right hand side 

of (C.30) equal to zero) provide a spectrum within which the "spectrum 

of control authority" (those roots associated with the generation of 

T from measurements which are yet to be specified) must lie. Thus, 
c 

erroneous information is obtained from analysis or simulation if we 

insist upon modeling modes within the control mechanisms which can 

excite structural modes of the physical system but which are absent 

from the system model (C.29), (C.30). Likewise, erroneous results are 

obtained if there are structural modes modeled in (C.30) which can 

interact with modes of the physical controller implementation but 

which modes are absent in the model for T in (C.29). Therefore, to 

c 

be completely honest about the model for T^ we must admit that T^ 
cannot be rationally described before an eigenvalue analysis is 
completed for the appendage Equations (C.30), and before some controller 
hardware is proposed. (In other words we cannot simply write T^ in 
(C.29) and proceed with the design or analysis (i.e., optimization) 
and then later design the dynamical elements which produce T^ without 
returning to check the spectrum compatibilities discussed above) . 

For the LST mode in (C.29) we have already neglected any shock 
mount or elastic suspension of the CMGs or reaction wheels. It is 
therefore inappropriate to include any appendage modes or controller 
dynamics which exceed the shock mount frequencies. The functional 
block diagram of the LST control system is shown in Figure C-3, and the 
details of each block are provided in the next section with a view toward 
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completing the system equations which presently have the form in 
(C.29), (C.30). 

Description of the Control System for LST; 

For the rate gyro transfer function we have from (see Figure C-3) , 

1 


‘=rg'=> = —J 




■ 0 (s) 


(C.31) 


(27T 16) 


2 2tt 16 


•• • 2 2 

or £i)'+2?a)a)’+a)'o3 = wa)„, which yields for each root of G (s) 

r r r r 2’ ■' rg ' 

.7 

0) = 2tt 16 
r 


I A L _ = OJ = 2tt 16 
' ry ' 1 , 2 r 

For the position sensor 

1 


(C.32) 


G„„(s) = 

gs 


8 


9' 


which yields 

1^ 1 
' gs‘ 


Fp(s) = 


— + 1 
2it 

0 

= 03 8 , 

(o) = 2 tt) 

g 

8 

2TT 


to digital 

(A/D) prefilter 

1 

03” 

® +1 
on * 

US’ 


(C.33) 


(C.34) 


(C.35) 


and the associated root 


A i= 27T 20 
P' 


applies to both prefilters for position and rate 


(C.36) 


03” 

+ 

03 

3 

3 

II 

3 



rp 

rp 

6” 

+ 

03 

CD 

3 

II 



rp 

rp 


(C.37) 
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Figure C-3. 






The reaction wheel subsystem is illustrated in block diagram 
form in Figure C-4 . Neglecting the friction and the amplifier non- 
linearities, the governing equations are, for each reaction wheel 


j=l,2,3,. . . V (let V = 4) . 

Ei(s) = -h - Rp ip^ 

a 

EzCs) = +K^ +K^E^(s) 

E^(s) 




h. 

L_ 

dr, S 
ROJ 


h, 


= - K i. 
m f 


(C.38) 


where h^ is the angular momentum (magnitude) of the j rotor. 

The following is a qualitative argument for studying the linearized 
equations for the reactioxi wheel. Some noise (which is always present 
in the physical circuitry even though it is not modeled yet) will tend 
to "smooth" a threshold type of nonlinearity as can be illustrated by 
the use of the dual input describing function (DIDF) . Thus, the 
amplifier threshold and stiction of the wheel are neglected in this 
model. The elasticity of the bearing/shaft interface allows for 
linear motion in the small (in the neighborhood of zero wheel speed) . 

For the more frequent non-zero wheel speed circumstance, the friction 
plays less of a role in the dynamics . Based upon these qualitative 
arguments, linearity of the reaction wheel model is therefore assumed. 
Quantitative arguments must await further test data. 
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h! is the conimanded rate of change of the angular momentum of 
J 

the j * rotor. In differential equation form the reaction wheel model 
(C.38) becomes 



h, = - K i^ 

3 m f 

Or, for the four reaction wheel systems 

K 


dt 


'R + K,R , 

T A F \ B 

, -^4 J^„L 
m / RW m 


-K I, 
m 4 


+ 


— I 
L 4 
m 


(C.39) 


h’ 


(C.40) 




where 


/\\ 




/M 


IK\ 

^2 

> i 4 


0 , = 

^2 

Vi ' — 


“3 

1 ^f3 

^3 

J n - 


\\l 


Vui 




[hi 


A identity matrix 

where we have assumed the same hardware parameter yalues for each of 
the four reaction wheel subsystems. 

The middle term on the right hand side of (C. 29) may be written 




i=l 
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(C.41) 



where 


■"s . Y^ . Y^l 


h A (h^,h2»h2,h^) 


S = 

• _** -'• 


S h 


(C.42) 


Likewise the next to last term in (C.29) may be written 
4 

L 

i=l 

For the rotor mounting arrangement for LST, 


S = • 


The system equations, thus far, may be written, neglecting the CMGs 
in (C.29) , 


-.342 

-.342 

.342 

-.342“ 


.664 

.664 

.664 

.664 

(C.43) 

.664 

,664 

-.664 

-.664. 



J*(L = T - ’^Sh - W^Sh - Tq 


• f ** T • 

Mq + Dq + Kq = i A - F W 


to! + 25 0 ) (I)' 4- 

2 r r 2 2 r 

• ^ 

to! + 25 (0 (o' + (o' to 

3 r r 3 3 t 


2 

= (0 to„ 
r 2 

2 

= to to_ 

r 3 


9A 

+ (JO 6! 

= to 9, 

2 

g 2 

g ' 

e; 

+ to 9! 

= to 9, 

3 

g 3 

g ■ 


+ 0 ) to" 

= to ( 

2 

rp 2 

rp 


+ to to'! 

= to ( 

3 

rp 3 

rp 

6'' 

+ (0 9!! 

= to 1 

2 

gp 2 

gp 

6'^ 

+ to 9" 

= to 

3 

gp 3 

gp 


system (rigid body + appendage) 
appendage 

rate gyros 
position sensors 


prefilters, F 


= - K i 
m 

= - p^i+ p^ h - p^ 


1 


reaction wheels 


The collection of these equations is numbered (G.44). 
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External Disturbances ; 


The external disturbances acting on LST, in approximate order of 
their importance, include: 

1. gravity gradient 

2 . aerodynamic 

3 . magnetic 

4 . thermal 

5 . solar pressure 

6 . micrometeorites 

Those internal effects which have not been included in the model thus 
far include disturbing effects from: 

1. rotating machinery (vibrations from unbalanced rotors) 

2. sensor and electronic noise 

3. equipment motion (pumps, shutters, etc.) 

4. instrumentation anomalies (bias errors in instrument 
alignment and amplifiers) 

The first four of the external disturbance effects listed above are 
considered important for LST . The thermal effects (4) have been 
considered in the derivation of the flexible spacecraft model (C.44). 
The first three, therefore, remain to be described. 

Gravity Gradient Torques : 

Derivations of gravity gradient torques are found in several 
sources, but the expressions in Reference 67 will be most useful here. 
From Equation (B.22) in Reference 67, for circular orbit, 

= 3^2^ i3 X JU* i3 (C.45) 
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where 


^ z 
0 = !-'§■ 


N = 


9 = 


•CHt 

0 

r 0 

0, 


L 


]b = 0 ^ 


“Sn sn^ 

-cn 

X t 

X 

cn 

-sn 

X 

X 

sn cn^ 

cn 

X t 

X 


g = N 0^ b 


tJ 


- 0 , 

0 

9, 


®2-' 


-e. 


(C.46) 


= angle between orbital plane and sum line 

n^ = vehicle orbital position measured from orbital midnight 

(n^ = t for circular orbit) 

^ = orbital rate 

o 

iJ = inertia dyadic of vehicle (assume rigid) 

{b}’^ j {b} 


We wish to obtain an expression for the torques in _b coordinates and 
linearized about 9=0. 

T® = T® 

T® « T® + G® 9 (C.47) 

To proceed, define from (C.46) 

T 

n„ A (sn^, sn cn^, cn cn^) 

3 = t X t ■ X t 

A n^ 0^ = n^[I + '0] (C.48) 

Then (C.45) may be written in ^ coordinates as 

T® = ^ J q (C.49) 
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But from (C.48), (C.49) becomes 


T® = 3Q^ ([l-0]n_r J[I-e]n. 
o J J 


» 3Sil 1*3 J 113 + S 3 J n 3 0 - ( e n 3 )'’j n 3 | 
discarding second order terms in 9. A useful identity 

(On^)*^ = n^ 6 ^ - 9 n^ 
gives (C.50) the form 


T® = 




2 Jn^+ n^ J n^ 6 - J® + >^3 ‘^’^ 3 ®] 


Hence, comparing (C.51) to (C.47) 

T® = 3^^ n^ J n^ 

{'^3 ^ ^^3 ’^3 


o * o 3 
G® = 3f2' 


For the case 


n » 45' 

X 


J = 


T® in (C.52) becomes 
o 


Q = .00111 rad/s 
o 


18,357 0 0 

0 41,822 0 

0 0 44,484 


Kg m‘ 


(C.50) 


(C.51) 


(C.52) 
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/^3„ ^'^33" '^22A 


T® = 3 (.00111)' 
o 


^2 ^3 

"^3 


/ 




= 3fi 


^3i ’^32 ^‘^22" “^ll^ 
I (1+ c2S2^0 s n^cn^ 


= 3J2“ 


/ 2 
' A-- sn cn c S7 t 
32 X X o 


A cn — s2 fi t 
13 X 2 o 


\ 


A«- sn — s2 t 
21 X 2 o 


s2fi t cn 
2 o X 


V 


3fi 


13 


1 s2fi t sn A-, 

2 o X 21 


" sn cn A_„ 
X X 32 

0 


/ ^ \ 

0 

cn A 
X 13 

0 

1 s 2^2 t 1 

1 ° 1 

0 

sn A 

X 21 

0 

\c 2Q t J 
\ ° / 


Using the above data. 


o 


G® = 


.002 

0 .002 



1 \ 




0 

-.034 0 


s 

2f2 t 
o 

n ,m. 


(C.53) 

0 

.031 0 



2fi t] 




- 

- 


\ 





r 0 

.0035s2n 


" • 

0035s2n 

"1 




t 



t 




.034s2n 

,031s2n 


-.089 + .059c2n 
t t 


-.024 - .024 c2n 


-.022 - .022c2n -.022 - .065 c2n 

t t t. 


(C.54) 


where for circular orbit n^ 0 , t. Note that due to the small angles 

to ^ 

0^ the term T® in (C.47) is more significant in magnitude than is the 

G® 0 term. 

Magnetic Torques: 

An approximation of the geomagnetic potential, taken from 
Equation (6) of Reference 68, is 
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,53PEODUCrolLlW ' 
iBiSINAL ^ 


E ( 

where from Figure 4, p. 17 of Reference 68, 

r = radius of earth 
o 

R = radius of orbit 

X = 1/2 - X 

X = latitude 

(j) = easterly longitude 

, h™ = Schmidt coefficients (see Table 1 in Reference 68) 

“n n 

P™(X) = Legendre functions with Schmidt normalization 
n 

The magnetic field in a region containing no sources satisfies 

B “ - V V (C.56) 


V « r 


n=l 




n+1 


(g"* cos m(j) 4- h^ sin mc())p"'(X-) 
n n ^ n 


(C.55) 


and the torque applied to the spacecraft is 


t“ = M X B 


B = {b> B, 


M = {b} 




(C.57) 


where M is the magnetic field generated within the spacecraft. Now 
we wish to truncate the series for v and express in the vehicle 
coordinate frame, ]b. To relate X and (|) to the n and n^ of previous 
sections define the coordinate frame ii as in Figure C-5. 

The relation between the ^ frame and the spherical coordinate 
frame ^ in the figure is 


s = 


i = B i 


(C.58) 
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Now the magnetic field in coordinates is, from Equation (8) in 


Reference 68, with changes in notation. 




/ 1 9v 
1 ^9X 

\ 

h 


1 

9v 


Rsin X 

9(j) 

Vi 

s 

[ 9v 

\ 9r 

/ 


where 


(C.59) 


X A ir/2 - X 


B . {s}’' B = {b}’^ B. 
~ — s — b 


Now since from Reference 67, p. A-5 
i = Q b , 
where 


Ct 

-sxst 

-cxst 

0 

cx 

- sx 

st 

sxct 

cxct 


(C.60) 


[I + e] 


From (C.58), (C.60), 

£ = ¥ Q b 

s3ct -sBsxst -b c3cx 


BQ = 


-c3ct c3sxst + s3cx 
. St sxct 


(C.61) 


-s3cxst - c3sx 
c3cxst -s3sx 
cxct 


[I + 6] 


Thus 


B = B Q B, 
s ^ b 


T — T 

B, = Q B^ B 
b . £ 


(C.62) 
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Since from Equation (11) of Reference 68, 


3/ 


B = - g° i-° 
s ^1 \ R 


sin X 
0 

^ cos X 


for a simple dipole model of the magnetic field (only the first term 
of the spherical harmonic expansion in (C.55) is needed). From 
Reference 69 . 

g° = - 30425 gauss 

r = 6371.2 X 10^ m 
o 

The angle X can be related to n^ from the geometry of Figure C-5 
using spherical trigonometry. 


sin(n - IT + n^) 

y t 


sin X _ sin(TT/2 - X) _ cos X 
sin 3 sin3 ” sin 3 


Then by the sum of angles indentity 


cos X = sin 3(sin(n - 7T)cos n^+ cos(n - 7r)sin n ) 

y t y t 


sin A 




2t 

cos a 


(since 0 < A < tt) 


Then 


B 

s n \r 


It 

0 


2 2 2 2 2 

1-s 3(S ^c^ + cf s: + 2S c 

y-TT t 1-TT t y-TT y 


S c )1 
-TT t t J 


l/2\ 


(C.63) 


y2.S3(S(n - 7T)Ct + C(n - 7r)St) 


and from (C.62) 
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(i) <s ®' 


/[l-S^3(S^(n -7T)C^+C^(n -7r)S^+2S(n -TT)c(ny->7T)S^C^) ] 


i2S3(s(lI^~ 7 T)c^ + 7T)St) 


(C.64) 

By Inspection of (C.57) and (C.64) it is apparent that is not 
simply sinusoidal in time and that the linearized form of the magnetic 
torques 

B^« t“ + G™0 

will yield terms in T™ and g"* which will, if expanded in a Fourier 
series, contain higher harmonics of the orbital frequency. Specifically 
it can be shown from (C.61), (C.64) that 


t" - - q? ? X' 


(C.65) 


- - 4 (/)' (^') = r 


(C .66) 


Equation (C.65) could now be expanded into a Fourier series in time in 
order to obtain the coefficients of the various harmonics of orbital 
frequency. Of specific interest will be the first three or four 
harmonics 


t”' = sin n + cos n + T® sin 2n^ 

o oo 01s t 01c t 02s t 


(C.67) 


+ cos 2 n + T® sin 3 n.+ T® cos 3 n + 

02c t 03s t 03c t 
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The magnitude of the magnetic field assumed for the LST spacecraft is 
100 Gauss. The first four harmonics of T™(t) have been approximated 
for use in the text as 

t“ = .003(sin .OOlllt + sin .00222t + sin .00333t + sin .00444t) 
o (C.68) 

Finally, the linearized disturbances from gravity gradient and 
magnetic torques can be written (aerodynamic torques have been con- 
sidered only to add some small amplitude to the Fourier terms com- 
prising gravity gradient and magnetic torques.) 


T = (T^ + T™) + [G® + g”^]9 = T + G6 (C.69) 

o o o 

so that the total system equations may now be written, substituting 
9 for 0) in the linearized equations 

J*9 + Fq = - ’^Sh + (^Sh)'^9 + G9 + T^(t) }rigid body + appendage 

Mq + Kq + r'^ 9 = £#^A(t) }appendage 

h = -K ^ 

^ } reaction wheels 

i = -p^i + p^h - h’ 


65' 

+ 2g Ui 

, 2 
+ 00 (X 

2 

r r 2 

r 

to' 

+ 2? 00 tol 

, 2 
+ OO 0 

3 

^r r 3 

r 

9' 

+ 00 91 = 

( 

CD 

3 

2 

g 2 

g ^ 

01 

+ 00 01 = 

( 

CD 

3 

3 

g 3 

g ^ 

to" 

+ 00 to" = 

00 ool 

2 

rp 2 

rp 2 

ti)'' 

+ 00 to'l = 

00 (Ol 

3 

rp 3 

rp 3 

Q'o 

+ 00 9'4 = 

- C 

CD 

3 

2 

gP 2 

gp / 


+ 00 9'' = 

( 

CD 

3 

3 

gP 3 

gp J 


2 • 

9 

r 3 


}rate gyro 


(C,?0) 


}positlon sensor 


}Prefilter rate signal 


}pref liter, position signal 
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where external forces and torques on the appendage are neglected 

(£ = 0 ) . 


To proceed to study this system of equations it is helpful to 
first make some coordinate transformations to further simplify its 
structure. Even if the equations were time invariant, an eigenvalue 
analysis of these original equations might be too large a task for 
the computer. Thus, two steps may be taken to simplify the task. 
First, replace the time varying G with its value averaged over one 
orbit. (It can be seen in the derivations of the external disturb- 
ances that the G0 term is smaller than the T term in (C.69) if 6 is 

o 

small). Also, long experiments (5 hours) are required by LST and the 
averaging of G over one orbit («sl.5 hours) seems acceptable. T^(t) , 
however is not to be averaged, since its time dependence may be 
important to the control problem. 

A further simplification is available if the cross product 
between vehicle rate and rotor momentum is neglected. In this event 
the term (^Sh)~6 in (C.70) is discarded. This step is more justified 
if the nominal momentum is zero. 

The appendage equations can be transformed to distributed or 
"modal” coordinates, n , by the choice 

q = $T) (C.71) 

where 5 is the matrix of assumed independent eigenvectors satisfying 
the orthonormality relationships (see Reference 66 for further 
discussion) . 
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$ = 


M cj)^ = 1 

k=l 

M = 0 


-e- 

C_l. 

II 

O 



^2 


,V 


Then the appendage equation takes on the form, 

multiplying (C.30) by 

• 2 'T <p.. T f •• 

Ti + 2 Ccjr] + CT r| + ^r 0 = $ mA(t) 


using (C.71) and pre- 
(C.72) 


where modal damping C, is added in this form. 

Now the first two equations of (C.70) may be written 



Solving for the highest derivatives 



where 


Sh 0 

✓s— 1 

“g o" 


0 -2?a 

. 1-2 A J 

1 

0 

1 

Q 


- 





L3 A 

1 PsK 

ri~l m 

L 0 , 

’ 1 

s A 

■"3 

_o 5^. 


/s 

J A 

r * 1 

j r $ 

L$ r v_ 

. 3-1 . 


-f"^ r 

I + 

$ 

T -1 

■ F r 


(C.73) 


(C.74) 


(C.75) 
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Now define the state vector x^, the control vector u°, the output 
vector, y^, and the measurement vector, z^. 




II 

62 


62 

®3 


0 )' 


0)1 


0); 


0); 


0 ) 


.2 


0 ). 


0 

-D. 

6 

n 


93 " 
/.je 


23 ’ 




23 ' 

0 ) 


23* , 

0 )’ 


0 

.D. 

0 

n 


, u A h' 


y-^ A (62. 63. 63, 63) 


z^ A ( 0 '^, 6!J, 0)'^. 0)”) 


(G.76) 


where 


h. A h - h 
A = 

h = current value of h(t) at time of update of linearized 
model 


Then the linear state equations may be written 



.1 .1 1 ^ O ^ 

x=Ax+Bu + 
1 pi 1 

y = C X 

1 vfl 1 
z = M X 


1 

L w 


(C.77) 


where 


A 


23.” 23., 23 ” 23 ' 

0 6* oj w 


-0) I. 0) I. 0 0 

gp 2 gp 2 

0 -03 I- 0 0 

g 2 


0 

0 

0 

0 


0 

0 


0 -03 I. U3 I_ 

rp 2 rp 2 


0 

0 

0 
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0 
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0 

0 


0 

VI 2 

0 

0 


23* ' 

03 


0 

0 


0 

I, 


23 • 23* • 

01 0 n 01 9 n h^i 


00000 00 


0 03 1^0 0 0 0 0 

g 2 


0 0 0 0 0 0 0 


00000 000 


-2?^03_I^ 0 0 0 0 03^1^ 0 0 0 


’r r 2 
0 


^3+v ° ° 


0 L, 


0 0 0~K I,, 

m. -y 


0 Pol/. -P-,1 


23 q ” 


233 , 


23 " 

03 

23 , 

03 

23*, 

03 ’ 

0 

n 


A 


'2"4 " 

(C.78) 


b" A 


~ 0 


1 

0 

0 

0 


0 0 

0 


0 0 

0 


0 0 

0 

, A 

0 0 

0 


0 0 

0 



0 


0 0 

-P 3 I 4 

•mm 


0 P 2 


w = 



(C.79) 
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Steps are outlined in section 6.0 of the text to reduce such a model 
for control design. 
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